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ABIOTIC STRESS RESPONSIVE POLYNUCLEOTIDES AND POLYPEPTIDES 



Cross Reference to Related Applications 

This application claims priority of U.S. Provisional Patent Application 60/300,1 12 
5 filed June 22, 2001 ; U.S. Provisional Patent Application 60/3 14,662 filed August 24, 2001 ; 
U.S. Provsional Patent Application 60/325,277 filed September 26, 2001; and U.S. 
Provisional Patent Application 60/332,132 filed November 21, 2001, each of which is 
incorporated by reference in its entirety for all purposes including, but not limited to, all text 
figures, tables, sequence listings, supplemental tables, supplemental figures, appendices and 
1 0 material submitted on electronic media. 

Reference to Material Submitted on Compact Disc 

The sequence listing accompanying this application is contained on compact disc 
pursuant to PCT Rules S9bis and 89ter, and PCT Administrative Rules Part 8. The compact 
1 5 disc containing the sequence listing has been submitted in triplicate on compact discs labeled 
"Copy 1", "Copy 2" and "Copy 3." An additional copy labeled "CFR" has also been 
provided. Each compact disc contains a text file named 60073_Seq_Lst which is 38.8 MB in 
size and which was created on June 20, 2002. 

20 Field of the Invention 

The present invention relates to polynucleotides and polypeptides the expression of 
which is altered in response to abiotic stress and to regulatory elements that are responsive to 
abiotic stress. 

25 

Background 

Recent advances in the speed and ease of large-scale DNA sequencing and the 
computational power of bioinformatic data analysis have permitted scientists to develop 
methods for studying biological processes on a genome-wide scale, giving rise to the field 
30 known as genomics. Genomics is viewed as a powerful tool for answering fundamental and 
complex questions regarding the structure,, function and evolution of biological systems. 

Structural genomics includes the sequencing and mapping of a genome, where the 
material sequenced may be genomic DNA, or the sequencing of other materials such as 
expressed sequence tags (ESTs) and development of genetic maps based on information from 
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visible and/or molecular markers, or physical maps constructed, for example, using yeast 
artificial chromosomes (YACs) and/or bacterial artificial chromosomes (BACs). Sequencing 
and mapping of the genomes of various organisms provide the tools for comparative 
genomics, which can be used to study how genes and genomes are structured and how they 
5 evolved, and may provide insight into the functions of genes and other DNA regions by 
studying their parallels in other organisms. 

Functional genomics utilizes the tools of genomics research to elucidate function, 
largely by large-scale measurements of gene expression to create expression profiles of the 
genome, and by gene mutagenesis and/or knock-out to observe the in vivo effects of altered 

10 genes. Functional genomics encompasses gene discovery, gene expression, protein and 
nucleic acid structure and function, gene and gene product interactions, and genomic 
approaches to breeding and comparative studies relevant to ecology and evolutionary 
biology. Important tools of functional genomics include: arrays, typically microarrays, of 
nucleotide sequences to simultaneously measure the expression of hundreds or thousands of 

1 5 genes; differential display, which makes it possible to analyze and compare transcribed genes 
to detect differentially expressed mRNAs; transcript profiling or cDNA-amplified fragment 
length polymorphism (cDNA-AFLP) to analyze gene expression; gene knock-out using 
classical mutagenesis or insertional mutagenesis; and bioinformatic tools to analyze 
experimental data and make predictions. 

20 Comparative functional genomics provides the opportunity to translate the knowledge 

gained from one organism, often a model system such as Arabidopsis, to another system such 
as a crop plant, by comparing genome organization, function, and evolution. Comparative 
functional genomics facilitates a greater understanding of: phylogenomics, the study of the 
evolution of genes and gene families using DNA sequence information from organisms 

25 selected at major branch points along the phylogenetic continuum; biochemical pathways, the 
orderly flow of materials and information in living organisms genomic analysis of complex 
traits, with the objective to elucidate the mechanisms of multigenic control; and host- 
environment interactions, aimed at understanding the molecular genetic basis for host 
responses to changes in environment on a genomic scale. 

30 Proteomics is a "post-genomics" set of methodologies that encompasses studies of 

protein-structure, -function, and interaction. Regulomics is the study of gene expression at 
the level of genetic network regulatory mechanisms to identify and characterize regulatory 
elements (both cis-acting and trans-acting) that control gene expression. Furthermore, the 
associated genes, including key mediators in diseases and metabolic processes, can be 
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identified and the disease-related genetic regulatory circuits can be constructed, facilitating 
the discovery of new and better points of therapeutic intervention. Pharmacogenomics refers 
to using genomic information for the design and discovery of new drugs and new therapeutic 
approaches. 

5 Computational genomics refers both to computational processes for analyzing 

sequence and expression data from in vivo experiments and also to computational processes 
for carrying out "in silico" experiments, including electronic hybridization (sequence 
comparison), predictions of gene structure, protein structure, protein function, direct protein- 
protein interactions, and higher-order interactions on a systems-wide basis. As such, the term 

10 "computational" can be applied widely, to denote computational comparative genomics, 
computational functional genomics, computational comparative functional genomics, 
computational comparative regulatory genomics, and the like. Computational genomics goes 
beyond the analysis of empirical information such as structure, function, and sequence, to 
develop theoretical frameworks using genomic information to create and test new proteins, to 

1 5 carry out protein design, to evolve genes and genomes in silico, to develop multidimensional 
phytogenies, to predict function based on context-dependent information including gene 
neighborhood and clusters of orthologous groups of proteins (COGs), and to use sequence 
data with genetic algorithms for modeling surfaces or solving complex problems. 

Integrative approaches to using biological and computational results obtained using 

20 genomics and post-genomics methodologies permit the development of systems-level tools to 
understand, model, and predict the biological functions. Combinatorial biology can be 
practiced by integrating biological information and computational information. Cellomics is 
an example of a systems-level approach to cellular or organismal function in space and over 
time. 

25 Germplasm improvement has been practiced for millennia to direct the evolution of 

plants and animals. Germplasm improvement can be directed to increasing both quantity and 
quality of an agricultural commodity, and may involve enhancing pre-existing traits or 
introducing traits that do not naturally occur in a given organism. Traditional or conventional 
breeding based on selection and crossing to introduce or enhance desired traits has been the 

30 avenue of germplasm improvement prior to the development of methods for genetic 
engineering. Genomics makes important contributions to both traditional and molecular 
methods of germplasm improvement. Genomics accelerates the discovery of genes that 
confer key traits and provides maps, markers, and other tools for enhancing traditional 
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breeding. Molecular methods of germplasm improvement utilize the products of genomics 
research in the design of genetic constructs to achieve a desired purpose. 

Plants and plant products provide the primary sustenance, either directly or indirectly, 
for all animal life, including humans. For the majority of the world's human population and 
5 for many animals, plants and plant products provide the sole source of nutrition. As the 
world population increases, the best hope to prevent widespread famine is to increase the 
quantity and improve the quality of food crops, and to make the crops available to the regions 
of the world most in need of food. 

Throughout history, a continual effort has been made to increase the yield and 

10 nutritious value of food crops. For centuries, plants having desirable characteristics such as 
greater resistance to drought conditions or increased size of fruit were crossbred and progeny 
plants exhibiting the desired characteristics were selected and used to produce seed or 
cuttings for propagation. Using such classical genetic methods, plants having, for example, 
greater disease resistance, increased yield, and better flavor have been obtained. 

1 5 A new paradigm for germplasm improvement in the cereals is based on the extensive 

similarities among the world's food cereals and other grasses in terms of chromosomal gene 
content and gene order. (Ahn et al.Mol Gen Genet 241 :483, 1993) Due to the conservation 
of gene order, or synteny, within cereal genomes, a gene on the chromosome of one grass 
species can be expected to be present in a predicted location on a specific chromosome of a 

20 number of other grass family species. (Bennetzen et al Proc Natl Acad Sci 95 : 1975, 1 998). 
Chromosomes of the various species, most of which differ in chromosome numbers, can be 
arrayed in concentric circles such that a radial line from the central species with the smallest 
genome will pass through regions of similar genie content in each of the other species. This 
concept has led the plant genetics community to view the grass family as a single genetic 

25 system. Recognition of these relationships has led to the prospect of gaining sufficient 

genomic information from one species to understand much of the genetics of a broad array of 
species. The identification of genes controlling important pathways such as for insect 
resistance, isolation of genes of various types, determination of directional pathways of 
evolution and location of useful genes from exotic sources, decision making on biodiversity 

30 conservation, and many other applications in plant breeding will be easier because of the 
heightened understanding of genetic relationships. Although synteny is a well-studied 
phenomenon in the cereal (grass) genomes, similar results are emerging for all groups of 
species, both plant and animal. 
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Rice is an important cereal crop for human consumption, with approximately half a 
billion tons produced annually. Rice has a genome size that is considerably smaller than the 
other major cereals, which results in a higher gene density relative to the other cereals. This 
smaller genome size and higher gene density makes rice an attractive model system for cereal 
5 gene discovery efforts and germplasm improvement through traditional breeding and 
molecular methods. Although large-scale sequencing and mapping of the rice genome is 
currently underway, there is currently no complete, assembled, and annotated genome 
available for rice. 

Microarray technology is a powerful tool that can be used to identify the presence and level 

1 0 of expression of a large number of nucleotide sequences in a single assay. A microarray is 
formed by linking a large number of discrete polynucleotide sequences, for example, a 
population of polynucleotides representative of a genome of an organism, to a solid support 
such as a microchip, glass slide, or the like, in a defined pattern. By contacting the 
microarray with a nucleic acid sample obtained from a cell of interest, and detecting those 

1 5 polynucleotides expressed in the cell can hybridize specifically to complementary sequences 
on the chip, the pattern formed by the hybridizing polynucleotides allows the identification of 
clusters of nucleotide sequences that are expressed in the cell. Furthermore, where each 
polynucleotide linked to the solid support is known, the identity of the hybridizing sequences 
from the nucleic acid sample can be identified. 

20 A strength of microarray technology is that it allows the identification of differential 

gene expression simply by comparing patterns of hybridization. For example, by comparing 
the hybridization pattern of nucleic acid molecules obtained from cells of an individual 
suffering from a disease with the nucleic acids obtained from the corresponding cells of a 
healthy individual, genes that are differentially expressed can be identified. The 

25 identification of such differentially expressed genes provides a means to identify new genes, 
and can provide insight as to the etiology of a disease. 

Microarray technology has been widely used to identify patterns of gene expression 
associated with particular stages of development or of disease conditions in animal model 
systems, and is being applied to the identification of specific patterns of gene expression in 

30 humans. The recent availability of information for the genomes of plants provides a means to 
adapt microarray technology to the study of plant gene expression^ 

The identification of plant genes involved in conferring a selective advantage on the 
plant to an environmental challenge would facilitate the generation and yield of plants, 
thereby increasing the available food supply to an increasing world population. In addition, 
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such knowledge provides a basis for diagnostic tests to identify stresses to which plants are 
subjected allowing implementation of practices to counter the stress. Thus, a need exists to 
identify plant genes and nucleotide sequences that are involved in modulating the response of 
a plant to changing environmental conditions. The present invention satisfies this need and 
5 provides additional advantages. 



Summary 

The present invention relates to clusters or groups of polynucleotides the expression of 
which is altered in response to one or more stress conditions and in particular abiotic stresses. 

1 0 Such clusters include, for example, plant polynucleotides whose expression is altered in 

response to two or more different stress conditions; and plant polynucleotides the expression of 
which is altered in response to one stress condition, but not to others. The identification of such 
clusters, using microarray technology, has allowed the identification of plant stress-regulated 
polynucleotides in rice; and homologs and orthologs thereof in other plant species and in 

1 5 particular cereals. Thus, the invention provides isolated polynucleotide sequences of stress- 
regulated nucleotide sequences from cereals, and homologs and orthologs thereof; variants of 
such sequences, and nucleotide sequences encoding substantially similar cereal stress-regulated 
polypeptides expressed there from. Such sequences include, for example, sequences encoding 
transcription factors; enzymes, including kinases; and structural proteins, including channel 

20 proteins. Accordingly, the present invention also relates to an isolated polynucleotide 

disclosed herein comprising a coding region that encodes a stress-regulated polypeptide from 
a cereal, and portions thereof. Also included is a regulatory element selected from the 
regulatory elements disclosed herein or functional portions thereof, which is involved in 
regulating the response of a cereal to a stress condition, for example, exposure to an abnormal 

25 level of salt, osmotic pressure, temperature or any combination thereof. 

The present invention also relates to a recombinant polynucleotide, which comprises a 
cereal stress-regulated nucleotide sequence disclosed herein or functional portion thereof 
operatively linked to a heterologous nucleotide sequence. In one embodiment, the 
recombinant polypeptide comprises a cereal stress-regulated regulatory element disclosed 

30 herein operatively linked to a heterologous nucleotide sequence, which is not regulated by the 
regulatory element in a naturally occurring plant. The heterologous nucleotide sequence, 
when expressed from the regulatory element, can confer a desirable phenotype to a plant cell 
containing the recombinant polynucleotide. In another embodiment, the recombinant 
polynucleotide comprises a coding region disclosed herein, or a functional portion thereof, of 
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a plant stress-regulated polynucleotide operatively linked to a heterologous promoter. The 
heterologous promoter provides a means to express the encoded stress-regulated polypeptide 
constitutively, or in a tissue-specific or phase-specific manner. 

One aspect of the present invention provides a method for determining whether a test 
5 plant, for example a cereal, has been exposed to at least one stress condition, for example an 
abiotic stress, comprising determining polynucleotide expression in the test plant to produce 
an expression profile and comparing the expression profile of the test plant to the expression 
profile of at least one reference plant that has been exposed to at least one stress, for example, 
an abiotic stress. In one embodiment the expressed polynucleotides are selected from the 

10 group consisting of any of the polynucleotide sequences contained in the sequence listing. In 
another embodiment, the test and reference plants are rice plants and the expressed 
polynucleotides are selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829, 17505-17506 or a functional portion thereof; the group consisting of SEQ ID 
NOs. 2159-3176; the group consisting of SEQ ID NOs. 1964-2158; the group consisting of 

15 SEQ ID NOs. 855-1963; or the group consisting of SEQ ED NOs. 1-854. 

Another aspect provides an isolated nucleic acid sequence comprising a plant 
nucleotide sequence, of at least 10 nucleotides long, that hybridizes under stringent 
conditions, or high stringency conditions, to the complement of any one of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829, 17505-17506, or a functional portion thereof, which is 

20 operably linked to a regulatory element or functional portion thereof. In one embodiment, 
the regulatory element or functional portion thereof alters transcription of an operatively 
linked nucleic acid sequence in response to an abiotic stress. In one embodiment, the 
regulatory element, or functional portion thereof, is selected from the group consisting of 
SEQ ID NOs. 13374-17504 or a functional portion thereof. 

25 Also provided are expression cassettes, plants and seeds comprising any of the above 

isolated sequences. 

Another aspect provides an isolated polynucleotide comprising a plant, for example, a 
cereal, nucleotide sequence containing a coding region for an abiotic stress responsive 
polypeptide, selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445- 
30 8829, 1 7505-17506, or a functional portion thereof; or a sequence that hybridizes under 
stringent conditions or highly stringent conditions, to the complement olany one of SEQ ID 
NOs. 1-4131, 8263-8353, 8445-8829, 17505-17506, or a functional portion thereof In one 
embodiment, the sequence is selected from the group consisting of SEQ ID NOs. 1-854. In 
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another embodiment, the sequence is selected from the group consisting of SEQ ID NOs. 
855-4131, 8445-8829 and 17505-17506. 

A further aspect provides an isolated polynucleotide comprising a rice nucleotide 
sequence containing a coding region for an abiotic stress responsive polypeptide, selected 
5 from the group consisting of SEQ ED NOs. 1-4131, 8263-8353, 8445-8829, and 17505- 
17506, or a functional protion thereof; or a sequence that hybridizes under stringent 
conditions or highly stringent conditions, to the complement of any one of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829, and 17505-17506, or a functional protion thereof. Additional 
aspects include, any of the afore disclosed nucleotide sequences, or functional portions 

10 thereof, wherein the polynucleotide is located in the drought tolerance QTL located on rice 
chromosome 3. Another embodiment provides any of the previously disclosed 
polynucleotides wherein said polynucleotide is from a genomic region syntenic with a maize 
cold tolerance QTL. A futher embodiment provides any of the previously disclosed 
polynucleotides wherein the polynucleotide is present in the drought tolerence QTL on rice 

1 5 chromosome 8. Still a futher embodiment provides any of the previously disclosed 

polynucleotides wherein the polynucleotide encodes a protein comprising a Universal Stress 
Protein A domain. 

One specific embodiment comprises an isolated nucleic acid molecule comprising a 
polynucleotide selected from the group consisting of: a) any one of the nucleotide sequences 

20 selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 
17505-17506; b) a function portion of any of the sequences of a); c) a polynucleotide that is 
substantially similar to a sequence of a) or b); d) a sequence of at least 15 nucletides that 
hybridizes under stringent conditions to a polynucleotide of a), b) or c); e) the complement of 
any sequence of a), b), c) or d); f) the reverse complement of any sequence of a), b), c) or d); g) 

25 a polynucleotide encoding any polypeptide selected from the group consisting of SEQ ED NOs. 
4132-8262, 8354-8444, and 8830-9214; and h) an allelic variant of any of the above. 

Another aspect provides an isolated polypeptide selected from the group consisting of 
SEQ ED NOs. 4132-8262, 8354-8444, and 8830-9214 or a functional portion thereof. 
Specific embodiments include, but are not limted to, an isolated polypeptide comprising: a) 

30 any one of the amino acid sequences selected from the group consisting of SEQ ED NOs. 
4132-8262, 8354=8444, and 8830-9214; b) afunctional portion of a); c) an amino acid 
sequence substantially similar to any sequence of a) or b); d) an amino acid sequence of a)-c) 
wherein there has been at least one conservative amino acid substitution; and e) an allelic 
variant of any of a)-d). 
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The invention further relates to a method of producing a transgenic plant, which 
comprises at least one plant cell that exhibits altered responsiveness to a stress condition. In one 
embodiment, the method can be performed by introducing a functional portion of plant stress- 
regulated nucleotide sequence into a plant cell genome, whereby the functional portion of the 
5 plant stress-regulated nucleotide sequence modulates a response of the plant cell to a stress 
condition. The functional portion of the plant stress-regulated nucleotide sequence can 
encode a stress-regulated polypeptide or functional peptide portion thereof, wherein 
expression of the stress-regulated polypeptide or functional peptide portion thereof either 
increases the stress tolerance of the transgenic plant, or decreases the stress tolerance of the 

10 transgenic plant. The functional portion of the plant stress-regulated nucleotide sequence 
encoding the stress-regulated polypeptide or functional peptide portion thereof can be 
operatively linked to a heterologous promoter. The functional portion of the plant stress- 
regulated nucleotide sequence also can comprise a stress-regulated regulatory element or a 
functional portion thereof, such as a minimal promoter. The stress-regulated regulatory 

15 element can integrate into the plant cell genome in a site-specific manner, whereupon it can 
be operatively linked to a heterologous nucleotide sequence, which can be expressed in 
response to a stress condition specific for the regulatory element; or can be a mutant 
regulatory element, which is not responsive to the stress condition, whereby upon integrating 
into the plant cell genome, the mutant regulatory element disrupts an endogenous stress- 

20 regulated regulatory element of a plant stress-regulated nucleotide sequence, thereby altering 
the responsiveness of the plant stress-regulated nucleotide sequence to the stress condition. 

One particular aspect provides a method for producing a transgenic plant comprising 
introducing into at least one plant cell a recombinant nucleic acid construct comprising i) any 
one of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506 or the complement 

25 thereof; ii) a polynucleotide substantially similar to any one of SEQ ID NOs. 1-4131, 8263- 
8353, 8445-8829 and 17505-17506 or the complement thereof; iii) any one of SEQ ID NOs. 
13374-17504 or the complement thereof; or iv) a polynucleotide substantially similar any one 
of SEQ ID NO 13374-17504 or the complement thereof. 

Further aspects include plants and uniform populations of plants made by the above 

30 methods as well as seeds and progeny from such plants. 

_ In another embodiment, a transgene introduced into a plant cell according to a method 
of the invention can encode a polypeptide that regulates expression from an endogenous plant 
stress-regulated nucleotide sequence. Such a polypeptide can be, for example, a 
recombinantly produced polypeptide comprising a zinc finger domain, which is specific for 
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the regulatory element, and an effector domain, which can be a repressor domain or an 
activator domain. The polynucleotide encoding the recombinant polypeptide can be 
operatively linked to and expressed from a constitutively active, inducible or tissue specific 
or phase specific regulatory element. Expression of the recombinant polypeptide from a 
5 plant stress-regulated promoter such as any of SEQ ID NOs. 13374-17504 or a functional 
portion thereof can be particularly advantageous in that the polypeptide can be coordinately 
expressed with the endogenous plant stress-regulated nucleotide sequences upon exposure to 
a stress condition. The invention also provides transgenic plants produced by a method as 
disclosed, as well as to a plant cell obtained from such transgenic plant, wherein said plant 
10 cell exhibits altered responsiveness to the stress condition; a seed produced by the transgenic 
plant; and a cDNA or genomic DNA library prepared from the transgenic plant, or from a 
plant cell from said transgenic plant, wherein said plant cell exhibits altered responsiveness to 
the stress condition. 

Thus, one aspect provides an isolated nucleic acid comprising a regulatory element 

15 comprising a sequence selected from the group consisting of SEQ ID NOs. 13374-17504 or a 
functional portion thereof, wherein said regulatory element alters transcription of an 
operatively linked polynucleotide in a plant cell, for example a cereal, in response to an 
abiotic stress. An additional aspect provides an isolated nucleic acid comprising a regulatory 
element having a sequence substantially similar to a sequence selected from the group 

20 consisting of SEQ ID NOs. 1 3374-1 7504 or a functional portion thereof, wherein said 

regulatory element alters transcription of an operatively linked polynucleotide in a plant cell 
in response to an abiotic stress. 

Further aspects provide an expression cassette comprising as operatively linked 
components, a regulatory element comprising a sequence from the group consisting of SEQ 

25 ID NOs. 13374-17504 or a functional portion thereof, wherein said regulatory element alters 
transcription of an operatively linked polynucleotide in a plant cell, for example a cereal, in 
response to an abiotic stress; a coding region; and a termination sequence. Another aspect 
provides an expression cassette comprising as operatively linked components, a regulatory 
element comprising a sequence substantially similar to a sequence selected from the group 

30 consisting of SEQ ID NOs. 1 3374- 1 7504 or a functional portion thereof, wherein said 

regulatory element alters transcription of an.operatively linkedj)olynucl_eotide in a plant cell, 
for example a cereal, in response to an abiotic stress; a coding region; and a termination 
sequence. Also provided are host cells and seeds comprising such expression cassettes, 
plants containing such host cells and seeds and progeny of plants containing said host cells. 
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In related aspects, the coding region of the expression cassettes comprise sequences encoding 
marker proteins and sequences involved in gene silencing such as antisense sequences, 
double stranded RNAi sequences, a triplexing agent, and sequences comprising dominant 
negative mutations. In additional related aspects, the coding regions comprise sequences 
5 encoding polypeptides that alter the response of a plant to an abiotic stress. 

The present invention also relates to a method of modulating the responsiveness of a 
plant, for example a cereal, cell to a stress condition. Such a method can be performed, for 
example, by introducing a functional portion of a stress-regulated polynucleotide described 
herein into the plant cell, thereby modulating the responsiveness of the plant cell to a stress 

10 condition. Such a method can result in the responsiveness of the plant cell being increased 
upon exposure to the stress condition, which, in turn, can result in increased or decreased 
tolerance of the plant cell to a stress condition; or can result in the responsiveness of the plant 
cell to the stress condition being decreased, which, in turn, can result in increased or 
decreased tolerance of the plant cell to a stress condition. In one embodiment, the functional 

15 portion of the plant stress-regulated polynucleotide can integrate into the genome of the plant 
cell, thereby modulating the responsiveness of the plant cell to the stress condition. In 
another embodiment, the functional portion of the plant stress-regulated nucleotide sequence 
encodes a stress-regulated polypeptide or functional peptide portion thereof, and can be 
operatively linked to a heterologous promoter. The functional portion of the plant stress- 

20 regulated polynucleotide also can contain a mutation, whereby upon integrating into the plant 
cell genome, the polynucleotide disrupts (knocks-out) an endogenous plant stress-regulated 
sequence, thereby modulating the responsiveness of the plant cell to the stress condition. 
Depending on whether the knocked-out gene encodes an adaptive or a maladaptive stress- 
regulated polypeptide, the responsiveness of the plant will be modulated accordingly. In still 

25 another embodiment, the functional portion of the plant stress-regulated polynucleotide can 
comprise a stress-regulated regulatory element, which can be operatively linked to a 
heterologous nucleotide sequence, the expression of which can modulate the responsiveness 
of the plant cell to a stress condition. Such a heterologous nucleotide sequence can encode, 
for example, a stress-inducible transcription factor such as DREB1 A. The heterologous 

30 nucleotide sequence also can encode a polynucleotide that is specific for a plant stress- 
regulated nucleotide sequence, for example, an antisense molecule, an RNAi molecule, a 
ribozyme, and a triplexing agent, any of which, upon expression in the plant cell, reduces or 
inhibits expression of a stress-regulated polypeptide encoded by the nucleotide sequence, 
thereby modulating the responsiveness of the plant cell to a stress condition, for example, an 
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abnormal level of cold, osmotic pressure, salinity or any combination thereof. Accordingly, 
the invention also relates to a plant cell, for example a cereal obtained by such a method, and 
to a plant, for example a cereal, comprising such a plant cell. 

The present invention also relates to a method of expressing a heterologous nucleotide 
5 sequence in a plant cell, for example a cereal. Such a method can be performed, for example, 
by introducing into the plant cell a plant stress-regulated regulatory element operatively 
linked to the heterologous nucleotide sequence, whereby, upon exposure of the plant cell to a 
stress condition, the heterologous nucleotide sequence is expressed in the plant cell. In one 
embodiment, the stress regulated element is any of the sequences described herein that are 

10 capable of altering transcription of an operatively linked sequence in response to an abiotic 
stress, for example, SEQ ID NOs. 13374-17504 or a functional portion thereof. The 
heterologous nucleotide sequence can encode a selectable marker, a diagnostic marker, or a 
polypeptide that confers a desirable trait upon the plant cell, for example, a polypeptide that 
improves the nutritional value, digestability or ornamental value of the plant cell, or a plant 

1 5 comprising the plant cell. 

The present invention further relates to a method of modulating the activity of a 
biological pathway in a plant cell, wherein the pathway involves a stress-regulated 
polypeptide or a non-protein regulatory molecule, for example a ribozyme, such as those 
encoded by sequences contained in any of SEQ ID NOs. 1-4131, 8263*8353, 8445-8829 and 

20 1 7505- 1 7506. Such a method can be performed by introducing a functional portion of a plant 
stress-regulated polynucleotide into the plant cell, thereby modulating the activity of the 
biological pathway. The method can be performed with respect to a pathway involving any 
of the stress-regulated polypeptides as disclosed herein or encoded by the polynucleotides 
disclosed herein, as well as using homologs or orthologs thereof. Also included are stress- 

25 regulated polypeptides or non-protein regulatory molecules that are encoded by sequences 
that are substantially similar to SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 and 17505- 
17506. 

The present invention also relates to a method of identifying a polynucleotide that 
modulates a stress response in a plant cell, for example a cereal In one embodiment the 
30 method comprises determining polynucleotide expression in a plant exposed to at least one 
abiotic stress to-produce an expression profile and identifying sequences whose expression is 
altered at least two fold compared to plants not exposed to the stress. Such an expression 
profile can be obtained, for example, by contacting an array of probes representative of a 
plant cell genome with nucleic acid molecules expressed in a plant cell exposed to the stress; 
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detecting a nucleic acid molecule that is expressed at a level different from a level of 
expression in the absence of the stress. The method can further comprise introducing the 
differentially expressed nucleic acid molecule into a plant cell; and detecting a modulated 
response of the genetically modified plant cell to a stress, thereby identifying a 
5 polynucleotide that modulates a stress response in a plant cell. In one embodiment, the 

differentially expressed nucleic acid is selected from the group consisting of SEQ ID NOs. 1- 
4131, 8263-8353, 8445-8829 and 17505-17506. The abiotic stress can be any abiotic stress 
such as exposure to an abnormal level of cold, osmotic pressure, salinity or any combination 
thereof. The contacting is under conditions that allow for specific hybridization of a nucleic 
1 0 acid molecule with a probe having sufficient complementarity, for example, under stringent 
or highly stringent hybridization conditions. Expression of the polynucleotide can increase 
or decrease the tolerance of the plant cell to the stress, and the nucleic acid molecule can be 
expressed at a level that is less than or greater than the level of expression in the absence of 
the stress. 

1 5 The present invention additionally relates to a method of identifying a stress condition 

to which a plant cell, for example a cereal, was exposed by comparing an expression profile 
from a test plant suspected of having been exposed to at least one stress to an expression 
profile obtained from a reference plant, preferably of the same species, which has been 
exposed to the suspected stress. Such a method can be performed, for example, by contacting 

20 nucleic acid molecules expressed in the test plant cell with an array of probes representative 
of the plant cell genome; detecting a profile of expressed nucleic acid molecules 
characteristic of a stress response, and comparing the expression pattern in the test plant to 
the expression pattern obtained from a reference plant thereby identifying the stress condition 
to which the plant cell was exposed. The contacting is under conditions that allow for 

25 specific hybridization of a nucleic acid molecule with probes having sufficient 

complementarity, for example, under stringent hybridization conditions. In one embodiment, 
the stress is an abiotic stress. The profile can be characteristic of exposure to a single stress 
condition, for example, an abnormal level of cold, osmotic pressure, or salinity, or can be 
characteristic of exposure to more than one stress condition, for example, cold, increased 

30 osmotic pressure and increased salinity. In one embodiment, the polynucleotides whose 
expression is detected are selected from the group consisting of SEQ ID NOs. 1-4131, 8263- 
8353, 8445-8829 and 17505-17506. In one embodiment, the plant is a rice plant. It is 
contemplated that in any of the above embodiments that the number of polynucleotides 
whose expression will be determined will be greater than one. Thus within the scope of the 
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invention are embodiment in which the number of different polynucleotides whose 
expression is determined for any one plant is at least 10, at least 25, at least 50, at least 100, 
at least 250, at least 500, and at least 750. 

The present invention further relates to a transgenic plant, for example a cereal, which 
5 contains a nucleic acid construct comprising a function portion of any of the plant stress- 
regulated polynucleotides of SEQ IDNOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506. 
In one embodiment, the transgenic plant exhibits altered responsiveness to a stress condition as 
compared to a corresponding reference plant not containing the construct. Such a transgenic 
plant can contain, for example, a construct that disrupts an endogenous stress-regulated 

1 0 nucleotide sequence in the plant, thereby reducing or inhibiting expression of the gene in 

response to a stress condition. Such a knock-out can increase or decrease tolerance of the plant 
to a stress condition. The transgene also can comprise a coding sequence of a plant stress- 
regulated nucleotide sequence, such as any of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 
and 17505-17506, which can be operatively linked to a heterologous regulatory element such as 

1 5 a constitutively active regulatory element, an regulated regulatory element, a tissues specific or 
phase specific regulatory element, or the like. In another embodiment, the transgenic plant 
contains a nucleic acid construct comprising a plant stress-regulated regulatory element, such as 
any of SEQ ID NOs. 1 3374-1 7504, which can be operatively linked to a heterologous nucleotide 
sequence that can encode a polypeptide or functional RNA. Expression of the heterologous 

20 polypeptide can confer a desirable characteristic on the plant, for example, can improve the 
nutritional or ornamental value of the transgenic plant. In still another embodiment, the 
transgenic plant contains multiple nucleic acid constructs, which can be multiple copies of the 
same construct, or can be two or more different constructs. 

The present invention also relates to a plant stress-regulated regulatory element, which is 

25 contained in a plant stress-responsive polynucleotide disclosed herein for example any of SEQ 
ID NOs. 13374-17504; a homolog or ortholog thereof. The invention also provides a method 
of identifying an agent, for example a transcription factor, that specifically binds to, or 
activates a plant stress-responsive regulatory element and in particular a stress responsive- 
regulatory element from any of SEQ ID NOs. 13374-17504. Such a method can be 

30 performed, for example, by contacting the regulatory element with a plant cell extract, and 
identifying polypeptides that specifically bind to the regulatory element. Confirmation that 
the specifically binding polypeptide is a transcription factor can be demonstrated using, for 
example, the stress-regulated regulatory element operably linked to a reporter gene, and 
detecting expression of the reporter gene. Control constructs comprising a regulatory 
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element, other than a plant stress-regulated regulatory element, operatively linked to a 
reporter molecule can be used to confirm that the transcription factor is specific for the plant 
stress-regulated regulatory element. A polynucleotide encoding such a transcription factor 
also can be obtained. 

5 The present invention also relates to a method of using a functional portion of a plant 

stress-regulated nucleotide sequence disclosed herein to confer a selective advantage on a 
plant cell, for example a cereal plant cell. In one embodiment, such a method is performed 
by introducing a plant stress-regulated regulatory element disclosed herein into a plant cell 
such as those described herein, wherein, upon exposure of the plant cell to a stress condition 

10 to which the regulatory element is responsive, a nucleotide sequence operatively linked to the 
regulatory element is expressed, thereby conferring a selective advantage to plant cell. The 
operatively linked nucleotide sequence can be, for example, a transcription factor, the 
expression of which induces the further expression of nucleotide sequences involved in a 
stress response, thereby enhancing the response of a plant to the stress condition. In another 

1 5 embodiment, a coding sequence of a plant stress-regulated sequence described herein is 

introduced into the cell, thereby providing the plant with a selective advantage in response to 
a stress condition. In still another embodiment, the method results in the knock-out of any of 
the plant stress-regulated sequences described herein in a first population of plants, thereby 
providing a selective advantage to a stress condition in a second population of plants. 

20 The invention further relates to a method of identifying an agent that modulates the 

activity of a stress-regulated regulatory element of a plant, for example a cereal. In a 
particular embodiment, is provided a method for identifying an agent that alters the activity 
of an abiotic stress responsive regulatory element comprising contacting the agent or a 
composition containing an agent to be tested with at least one abiotic stress responsive 

25 regulatory element, for example selected from the group consisting of SEQ ID NOs. 1 3374- 
17504, or a functional portion thereof, and determining the effect of the agent on the ability 
of the regulatory sequence to regulate transcription. In one embodiment, the regulatory 
element can be operatively linked to a heterologous polynucleotide encoding a reporter 
molecule, and an agent that modulates the activity of the stress-regulated regulatory element 

30 can be identified by detecting a change in expression of the reporter molecule due to 

contacting the regulatory element with the agent. Such a method can be performed in vitro in 
a plant cell-free system, or in a plant cell in culture or ina plant in situ. In another 
embodiment, the agent is contacted with a transgenic plant containing an introduced plant 
stress-regulated regulatory element, and an agent that modulates the activity of the regulatory 
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element is identified by detecting a phenotypic change in the transgenic plant. The methods 
of the invention can be performed in the presence or absence of the stress condition to which 
the particularly regulatory element is responsive. 

Another aspect provides a method for identifying an agent that alters abiotic stress 
5 responsive polynucleotide expression in a plant or plant cell, for example a cereal, 

comprising contacting a plant or plant cell with a test agent; subjecting the plant cell or plant 
cell to an abiotic stress or combination of stresses before, during or after contact with the 
agent to be tested; obtaining an expression profile of the plant or plant cell and comparing the 
expression profile of the plant or plant cell to an expression profile from a plant or plant cell 

10 not exposed to the abiotic stress or combination of stresses. In one embodiment, the 
expression profile comprises expression data for at least one sequence selected from the 
group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506. In 
additional embodiments, the plant or plant cell is a rice plant. By at least one, it should be 
realized that the above embodiments contemplate that the expression profile discussed above 

1 5 can contain expression data for greater than one of the described sequences, for example at 
least 10 sequences, at least 25 sequences, at least 50 sequences, at least 100 sequences, at 
least 250 sequences, at least 500 sequences or at least 750 sequences. 

Still another aspect provides nucleotide probes for the diagnosis of abiotic stress in 
plants, for example cereals, comprising nucleotide sequences of at least 15, 25, 50 or 100 

20 nucleotides, that hybridize under stringent or highly stringent conditions to at least one 

sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, 
and 17505-17506. 

An additional aspect provides a method for selecting plants, for example cereals, 
having an altered resistance to abiotic stress comprising obtaining nucleic acid molecules 

25 from the plants to be selected; contacting the nucleic acid molecules with one or more probes 
that selectively hybridize under stringent or highly stringent conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, 
and 17505-17506; detecting the hybridization of the one or more probes to the nucleic acid 
sequences wherein the presence of the hybridization indicates the presence of a gene 

30 associated with altered resistance to abiotic stress; and selecting plants on the basis of the 
presence or absence of such hybridization. In one embodiment, marker-assisted selection is 
accomplished in rice. In another embodiment, marker assisted selection is accomplished in 
wheat using one or more probes which selectively hybridize under stringent or highly 
stringent conditions to sequences selected from the group consisting of SEQ ID NOs. 9881- 
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1 1967. In yet another embodiment, marker assisted selection is accomplished in maize or 
com using one or more probes which selectively hybridize under stringent or highly stringent 
conditions to sequences selected from the group consisting of SEQ ID NOs. 1 1968-13373. In 
still another embodiment, marker assisted selection is accomplished in banana using one or 
5 more probes which selectively hybridize under stringent or highly stringent conditions to 
sequences selected from the group consisting of SEQ ID NOs. 9215-9880. In each case 
marker-assisted selection can be accomplished using a probe or probes to a single sequence 
or multiple sequences. If multiple sequences are used they can be used simultaneously or 
sequentially. 

10 A further aspect provides a method for monitoring a population of plants comprising 

providing at least one sentinel plant, for example a cereal, containing a recombinant 
polynucleotide comprising a stress responsive regulatory sequence selected from the group 
consisting of SEQ ID NOs. 13374-17504 which is operatively linked to a nucleotide 
sequence encoding a detectable marker, for example a fluorescent protein such as a green 

15 fluorescent protein, a yellow fluorescent protein, a cyan fluorescent protein, a red fluorescent 
protein or an enhanced or modified form thereof. 

A further aspect provides a computer readable medium having stored thereon 
computer executable instructions for performing a method comprising receiving data on 
nucleotide sequence expression in a test plant of at least one nucleic acid molecule having at 

20 least 70%, at least 80%, at least 90% or at least 95%, sequence identity to a nucleotide 

sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829 
and 17505-17506; and comparing expression data from said test plant to expression data for 
the same nucleotide sequence or sequences in a plant which has been exposed to at least one 
abiotic stress. 

25 Yet a further aspect provides a computer readable medium having stored thereon a 

data structure comprising, sequence data for at least one nucleic acid molecule having at least 
70%, at least 80%, at least 90%, or at least 95%, nucleic acid sequence identity to a 
polynucleotide selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829 and 17505-17506, or the complement thereof; and a module receiving the nucleic 

30 acid molecule sequence data which compares the nucleic acid molecule sequence data to at 
least one other nucleic acid sequence. 

An additional aspect provides a monoclonal or polyclonal antibody to any of the 
abiotic stresss related polypeptides disclosed herein, for example, SEQ ID NOs. 4132-8261, 
8263-8353 and 8830-9214. 
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A father aspect provides a method for identifying a homolog or ortholog of an abiotic 
stress responsive polynucleotide comprising determining the nucleotide sequence of a 
plurality of isolated polynucleotides to create a set of nucleotide sequences and translating the 
nucleotide sequences in the set to derive one or more putative amino acid sequences, based 
5 on one or more of the possible reading frames of the nucleotide sequences and their 

complementary sequences. Selecting an amino acid sequence of an abiotic stress-responsive 
protein and comparing the amino acid sequence of the abiotic stress-responsive protein with 
at least one of the putative amino acid sequences. Identifying putative amino acid sequences 
having homology with at least a region of the amino acid sequence of the abiotic stress- 

1 0 responsive protein; and correlating putative amino acid sequences having homology to 

translated nucleotide sequences. In one embodiment the amino acid sequence of an abiotic 
stress responsive protein is selected from the group consisting of SEQ ID NOs. 4132-8262, 
8354-8444, and 8830-9214. In another embodiment, the method further comprises, 
assembling a plurality of the translated nucleotide sequences that encode the putative amino 

15 acid sequences based on regions of overlap comprising at least 10 base pairs of identical 
sequences between two translated nucleotide sequences to form one or more nucleotide 
contigs. Translating the one or more nucleotide contigs into one or more amino acid contigs. 
Comparing the one or more amino acid contigs with the amino aicd sequence of the abiotic 
stress responsive protein to determine homology between the amino acid contig and at least a 

20 region of the abiotic stress responsive protein; and identifying at least one homolog of at least 
a region of the abiotic stress responsive protein based on the homology determined. 

Detailed Description 

The following detailed description is provided to aid those skilled in the art in 
25 practicing the present invention. Even so, this detailed description should not be construed to 
unduly limit the present invention as modifications and variations in the embodiments 
discussed herein can be made by those of ordinary skill in the art without departing from the 
scope of the present invention. 

All publications, patents, patent applications, public databases, public database 
30 enteries, and other references cited in this application are herein incorporated by reference in 
their entirety as if each individual publication, patent, patent application, public database, 
public database entry, or other reference were specifically and individually indicated to be 
incorporated by reference. 
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The present invention relates to clusters or groups of polynucleotides and the 
polypeptides encoded thereby, the expression of which is altered in response to abiotic stress 
conditions and to regulatory elements that are responsive to abiotic stress. In one embodiment, 
the invention provides polynucleotides and their associated polypeptides whose expression is 
5 altered at least 2X by abiotic stress. By a two fold alteration is meant that the change in 

expression level can be described by using a multiplier or divisor of at least two. For example, 
if the expression level were set at 100 prior to stress exposure, at 2X alteration would result in an 
expression level of > 200 or < 50. Abiotic stress conditions, such as a shortage or excess of 
solar energy, water and nutrients, and salinity, high and low temperature, or pollution (e.g., 

10 heavy metals), can have a major impact on plant growth and can significantly reduce the 
yield, for example, of cultivars. Under conditions of abiotic stress, the growth of plant cells 
is inhibited by arresting the cell cycle in late Gl, before DNA synthesis, or at the G2/M 
boundary (see Dudits, Plant Cell Division, Portland Press Research, Monograph; Francis, 
Dudits, and Inze, eds., 1997; chap. 2, page 21; Bergounioux, Protoplasma 142:127-136, 

1 5 1 988). The identification of stress-regulated polynucleotide clusters, using microarray 
technology, provides a means to identify plant stress-regulated nucleotide sequences. 

The polynucleotides and polypeptides disclosed herein are expected to be functional in a 
wide variety of plants and especially cereals. The wide-spread applicability of the sequences 
disclosed is supported by the finding of expressed sequence tags from banana, wheat and maize 

20 or com having homology to the disclosed polynucleotide sequences. Table 2 shows list of such 
sequences and the SEQ ID NOs of the corresponding banana, wheat and/or maize ESTs. 

As used herein, the term "cluster," when used in reference to stress-regulated 
polynucleotides, refers to polynucleotides that have been selected by drawing Venn diagrams, 
and selecting those nucleotide sequences that are regulated only by a selected stress condition. 

25 The selected stress condition can be a single stress condition, for example, cold, osmotic stress 
or salinity stress, or can be a selected combination of stress conditions, for example, cold, 
osmotic stress and salinity stress. In addition, a cluster can be selected based on specifying that 
all of the nucleotide sequences are coordinately regulated, for example, they all start at a low 
level and are induced to a higher level. However, a cluster of saline stress-regulated nucleotide 

30 sequences, for example, that was selected for coordinate regulation from low to high, also can be 
decreased in response to cold stress or osmotic stress. By varying the parameters used for 
selecting a cluster of nucleotide sequences, those nucleotide sequences that are expressed in a 
specific manner following a stress can be identified. 
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As used herein in reference to a polynucleotide or nucleic acid sequence, "isolated" 
means a polynucleotide or nucleic acid sequence that is free of one or both of the nucleotide 
sequences which flank the polynucleotide in the naturally-occurring genome of the organism 
from which the polynucleotide is derived. The term includes, for example, a polynucleotide 
5 or fragment thereof that is incorporated into a vector or expression cassette; into an 
autonomously replicating plasmid or virus; into the genomic DNA of a prokaryote or 
eukaryote; or that exists as a separate molecule independent of other polynucleotides. It also 
includes a recombinant polynucleotide that is part of a hybrid polynucleotide, for example, 
one encoding a polypeptide sequence. 

10 As used herein "polynucleotide" and "oligonucleotide" are used interchangeably and 

refer to a polymeric (2 or more monomers) form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides. Although nucleotides are usually joined by 
phosphodiester linkages, the term also includes polymeric nucleotides containing neutral 
amide backbone linkage composed of aminoethyl glycine units. This term refers only to the 

1 5 primary structure of the molecule. Thus, these terms include double- and single-stranded 
DNA and RNA as well DNA/RNA hybrids that may be single-stranded, but are more 
typically double-stranded. In addition, the term also refers to triple-stranded regions 
comprising RNA or DNA or both RNA and DNA. The strands in such regions may be from 
the same molecule or from different molecules. The regions may include all or one or more 

20 of the molecules, but more typically involve only a region of some of the molecules. The 
terms also include known types of modifications, for example, labels, methylation, "caps", 
substitution of one or more of the naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those with uncharged linkages (e.g. 
methyl phosphonates, phophotriesters, phosphoamidates, carbamates etc.), those containing 

25 pendant moieties, such as, for example, proteins (including for e.g., nucleases, toxins, 
antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., acridine, 
psoralen, tec,), those containing alkylators, those with modified linkages (e.g. alpha anomeric 
nucleic acids, etc.), as well as unmodified forms of the polynucleotide. Polynucleotides 
include both sense and antisense, or coding and template strands. The terms include naturally 

30 occurring and chemically synthesized molecules. 

As used herein, "sequence" means the linear order in which monomers occur in a 
polymer, for example, the order of amino acids in a polypeptide or the order of nucleotides in 
a polynucleotide. 
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A "recombinant" nucleic acid is one produced by human intervention in the 
nucleotide sequence, typically selection or production. Alternatively, it can be a nucleic acid 
made by generating a sequence comprising fusion of two or more fragments which are not 
naturally contiguous to each other. Thus, for example, products made by transforming cells 
5 with any unnaturally occurring vector is encompassed, as are nucleic acids comprising 

sequences derived using any synthetic oligonucleotide process. Such is often done to replace 
a codon with a redundant codon encoding the same or a conservative amino acid, while 
typically introducing or removing a sequence recognition site. Alternatively, it is performed 
to join together nucleic acid segments of desired functions to generate a single genetic entity 
10 comprising a desired combination of functions not found in the commonly available natural 
forms. Restriction enzyme recognition sites are often the target of such artificial 
manipulations, but other site specific targets, e.g., promoters, DNA replication sites, 
regulation sequences, control sequences, or other useful features may be incorporated by 
design. 

15 As used herein, the term "abiotic stress" or "stress" or "stress condition" refers to the 

exposure of a plant, plant cell, or the like, to a non-living ("abiotic") physical or chemical 
agent or condition that has an adverse effect on metabolism, growth, development, 
propagation and/or survival of the plant (collectively "growth"). A stress can be imposed on 
a plant due, for example, to an environmental factor such as water (e.g., flooding, drought, 

20 dehydration), anaerobic conditions (e.g., a low level of oxygen), abnormal osmotic 
conditions, salinity or temperature (e.g., hot/heat, cold, freezing, frost), a deficiency of 
nutrients or exposure to pollutants, or by a hormone, second messenger or other molecule. 
Anaerobic stress, for example, is due to a reduction in oxygen levels (hypoxia or anoxia) 
sufficient to produce a stress response. A flooding stress can be due to prolonged or transient 

25 immersion of a plant, plant part, tissue or isolated cell in a liquid medium such as occurs 
during monsoon, wet season, flash flooding or excessive irrigation of plants, or the like. A 
cold stress or heat stress can occur due to a decrease or increase, respectively, in the 
temperature from the optimum range of growth temperatures for a particular plant species. 
Such optimum growth temperature ranges are readily determined or known to those skilled in 

30 the art. Dehydration stress can be induced by the loss of water, reduced turgor, or reduced 
jwater content of a cell, tissue, organ or whole plant. Drought stress can be induced by or 
associated with the deprivation of water or reduced supply of water to a cell, tissue, organ or 
organism. Salinity-induced stress (salt-stress) can be associated with or induced by a 
perturbation in the osmotic potential of the intracellular or extracellular environment of a cell. 
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As disclosed herein, plant stress-regulated polynucleotides and clusters of stress- 
regulated polynucleotides have been identified. Surprisingly, several of the stress-regulated 
polynucleotides previously were known to encode polypeptides having defined cellular 
functions, including roles as transcription factors, enzymes such as kinases, and structural 
5 proteins such as channel proteins, but were not identified as being stress-regulated. The 
identification of rice stress-regulated nucleotide sequences has provided a means to identify 
homologous and orthologous nucleotide sequences in other plant species using procedures 
described herein. 

As used herein, the term "substantially similar", when used with respect to a 
1 0 nucleotide sequence, means a nucleotide sequence corresponding to a reference nucleotide 
sequence, wherein the corresponding sequence encodes a polypeptide having substantially 
the same structure and function as the polypeptide encoded by the reference nucleotide 
sequence, e.g., where only changes in amino acids not affecting the polypeptide function 
occur. Desirably, the substantially similar nucleotide sequence encodes the polypeptide 
1 5 encoded by the reference nucleotide sequence. The percentage of identity between the 
substantially similar nucleotide sequence and the reference nucleotide sequence is at least 
60%, at least 75%, at least 90%, at least 95%, or at least 99%, including 100%. A nucleotide 
sequence is "substantially similar" to reference nucleotide sequence that hybridizes to the 
reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP(>4, 1 mM 

20 EDTA at 50°C with washing in 2X SSC, 0.1 % SDS at 50°C; in 7% sodium dodecyl sulfate 
(SDS), 0.5 M NaP0 4 , 1 mM EDTA at SOT with washing in IX SSC, 0.1% SDS at 50°C 
(stringent conditions); in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPC>4, 1 mM EDTA at 
50°C with washing in 0.5X SSC, 0.1% SDS at SOT (high stringency); in 7% sodium dodecyl 
sulfate (SDS), 0.5 M NaPC>4, 1 mM EDTA at SOT with washing in 0.1X SSC, 0.1% SDS at 

25 SOT (very high stringency); or in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04, 1 mM 
EDTA at SOT with washing in 0.1X SSC, 0.1% SDS at 65T (extremely high stringency). 

As is well known in the art, stringency is related to the T m of the hybrid formed. The 
T m (melting temperature) of a nucleic acid hybrid is the temperature at which 50% of the 
bases are base-paired. For example, if one the partners in a hybrid is a short oligonucleotide 

30 of approximately 20 bases, 50% of the duplexes are typically strand separated at the T m . In 
this case, the T m reflects a time-independent equilibrium that depends on the concentration of- 
oligonucleotide. In contrast, if both strands are longer, the T m corresponds to a situation in 
which the strands are held together in structure possibly containing alternating duplex and 



WO 03/008540 PCT/US02/19668 

23 

denatured regions. In this case, the T m reflects an intramolecular equilibrium that is 
independent of time and polynucleotide concentration. 

As is also well known in the art, T m is dependent on the composition of the 
polynucleotide (e.g. length, type of duplex, base composition, and extent of precise base 
5 pairing) and the composition of the solvent (e.g. salt concentration and the presence of 
denaturants such formamide). On equation for the calculation of T m can be found in 
Sambrook et al. (Molecular Cloning, 2nd ed., Cold Spring Harbor Press, 1989) and is: 

T m = 81.5EC - 16.6(logio[Na + ]) = 0.41 (% G + C) - 0.63(% formamide) - 600/L) 
Where L is the length of the hybrid in base pairs, the concentration of Na + is in the range of 
10 0.01M to 0.4M and the G + C content is in the range of 30% to 75%. Equations for hybrids 
involving RNA can be found in the same reference. Alternative equations can be found in 
Davis et al., Basic Methods in Molecular Biology, 2nd ed., Appleton and Lange, 1994, Sec 6- 
8. 

Likewise, the term "substantially similar," when used in reference to a polypeptide 

1 5 sequence, means that an amino acid sequence relative to a reference (query) sequence shares at 
least about 65% amino acid sequence identity, at least about 75% amino acid sequence identity, 
at least about 85%, at least about 90% , or at least about 95% or greater amino acid sequence 
identity. Generally, sequences having an E < 10" 8 are considered to be substantially similar to a 
query sequence. Such sequence identity can take into account conservative amino acid changes 

20 that do not substantially affect the function of a polypeptide. 

Homology or identity is often measured using sequence analysis software such as the 
Sequence Analysis Software Package of the Genetics Computer Group (University of 
Wisconsin Biotechnology Center, 1710 University Avenue, Madison, WI 53705). Such 
software matches similar sequences by assigning degrees of homology to various deletions, 

25 substitutions and other modifications. The terms homology" and "identity," when used herein 
in the context of two or more nucleic acids or polypeptide sequences, refer to two or more 
sequences or subsequences that are the same or have a specified percentage of amino acid 
residues or of nucleotides that are the same when compared and aligned for maximum 
correspondence over a comparison window or designated region as measured using any number 

30 of sequence comparison algorithms or by manual alignment and visual inspection. 

For sequence comparison, typically one sequence acts as a reference sequence, to which 
test sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are entered into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. Default program parameters can be 
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used, or alternative parameters can be designated. The sequence comparison algorithm then 
calculates the percent sequence identities for the test sequences relative to the reference 
sequence, based on the program parameters. 

The term "comparison window" is used broadly herein to include reference to a segment 

5 of any one of the number of contiguous positions, for example, about 20 to 600 positions, for 
example, amino acid or nucleotide position, usually about 50 to about 200 positions, more 
usually about 100 to about 150 positions, in which a sequence may be compared to a reference 
sequence of the same number of contiguous positions after the two sequences arcoptimally 
aligned. Methods of alignment of sequence for comparison are well known in the art. Optimal 

1 0 alignment of sequences for comparison can be conducted, for example, by the local homology 
algorithm of Smith and Waterman (Adv. Appl. Math. 2:482, 1981), by the homology alignment 
algorithm of Needleman and Wunsch (J. Mol. Biol 48:443, 1970), by the search for similarity 
method of Person and Lipman fProc. Natl. Acad. ScL USA 85:2444, 1988), each of which is 
incorporated herein by reference; by computerized implementations of these algorithms (GAP, 

1 5 BESTFIT, FASTA, and TFAST A in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, WI); or by manual alignment and visual 
inspection. Other algorithms for determining homology or identity include, for example, in 
addition to a BLAST program (Basic Local Alignment Search Tool at the National Center for 
Biological Information), ALIGN, AMAS (Analysis of Multiply Aligned Sequences), AMPS 

20 (Protein Multiple Sequence Alignment), ASSET (Aligned Segment Statistical Evaluation 

Tool), BANDS, BESTSCOR, BIOSCAN (Biological Sequence Comparative Analysis Node), 
BLIMPS (BLocks IMProved Searcher), FASTA, Intervals & Points, BMB, CLUSTAL V, 
CLUSTAL W, CONSENSUS, LCONSENSUS, WCONSENSUS, Smith-Waterman 
algorithm, DARWIN, Las Vegas algorithm, FNAT (Forced Nucleotide Alignment Tool), 

25 Framealign, Framesearch, DYNAMIC, FILTER, FSAP (Fristensky Sequence Analysis 

Package), GAP (Global Alignment Program), GENAL, GIBBS, GenQuest, ISSC (Sensitive 
Sequence Comparison), LALIGN (Local Sequence Alignment), LCP (Local Content 
Program), MACAW (Multiple Alignment Construction & Analysis Workbench), MAP 
(Multiple Alignment Program), MBLKP, MBLKN, PIMA (Pattern-Induced Multi-sequence 

30 Alignment), SAGA (Sequence Alignment by Genetic Algorithm) and WHAT-IF. Such 
alignment programs can also be used to screen genome databases to identify polynucleotide 
sequences having substantially identical sequences. 

A number of genome databases are available for comparison. Several databases 
containing genomic information annotated with some functional information are maintained by 
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different organizations, and are accessible via the internet, for example, http://wwwtigr.org/tdb; 
http://www.genetics.wisc.edu; http://genome-www.stanford.edu/'-ball; http://hiv-web.lanl.gov; 
http://www.ncbi.nlm.nih.gov; http://www.ebi.ac.uk; http://Pasteur.fr/other/biology; and http:// 
www, genome.wi.mit.edu . 
5 In particular, the BLAST and BLAST 2.0 algorithms using default parameters are 

particularly useful for identifying polynucleotides and polypeptides encompassed within the 
present invention (Altschul et al. (Nucleic Acids Res. 25:3389-3402, 1977; J. Mol. Biol. 
215:403-410, 1990). Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov). This algorithm 

1 0 involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive- valued threshold 
score T when aligned with a word of the same length in a database sequence. T is referred to as 
the neighborhood word score threshold (Altschul et al., supra, 1977, 1 990). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing them. 

1 5 The word hits are extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Cumulative scores are calculated using, for nucleotide 
sequences, the parameters M (reward score for a pair of matching residues; always >0). For 
amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of 
the word hits in each direction is halted when: the cumulative alignment score falls off by the 

20 quantity X from its maximum achieved value; the cumulative score goes to zero or below, due 
to the accumulation of one or more negative-scoring residue alignments; or the end of either 
sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity 
and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults 
a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=4 and a comparison of both strands. 

25 For amino acid sequences, the BLASTP program uses as defaults a wordlength of 3, and 

expectations (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff, Proc. 
Natl. Acad. Sci., USA 89:10915, 1989) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, 
and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity between two 

30 sequences (see, for example, Karlin and Altschul, Proc. Natl. Acad. ScL USA 90:5873, 1 993). 
One ineasure of similarity provided by BLAST algorithm is the smallest sum probability (P(N))> 
which provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would occur by chance. For example, a nucleic acid is considered similar 
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to a references sequence if the smallest sum probability in a comparison of the test nucleic acid 
to the reference nucleic acid is less than about 0.2, less than about 0.01, or less than about 0.001 . 

In one embodiment, protein and nucleic acid sequence homologies are evaluated 
using the Basic Local Alignment Search Tool ("BLAST"). In particular, five specific 
5 BLAST programs are used to perform the following tasks: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

10 (3) BLASTX compares the six-frame conceptual translation products of a query 

nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence 
1 5 against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar 
segments, which are referred to herein as "high-scoring segment pairs," between a query 
amino or nucleic acid sequence and a test sequence which may be obtained from a protein or 
nucleic acid sequence database. High-scoring segment pairs can identified {i.e., aligned) by 

20 means of a scoring matrix, many of which are known in the art. In one embodiment, the 
scoring matrix used is the BLOSUM62 matrix (Gonnet et al., Science 256:1443-1445, 1992; 
Henikoflfand Henikoff, Proteins 17:49-61, 1993). In another embodiment, the PAM or 
PAM250 matrices may also be used (Schwartz and DayhofF, eds., "Matrices for Detecting 
Distance Relationships: Atlas of Protein Sequence and Structure" (Washington, National 

25 Biomedical Research Foundation 1978)). BLAST programs are accessible through the U.S. 
National Library of Medicine, for example, at www.ncbi.nlm.nih.gov . 

The parameters used with the above algorithms may be adapted depending on the 
sequence length and degree of homology studied. In some embodiments, the parameters may be 
the default parameters used by the algorithms in the absence of instructions from the user. 

30 As disclosed herein, clusters of stress-regulated polynucleotides (and their products), 

some of which also have been described as having cellular functions such as enzymatic activity 
or roles as transcription factors, are involved in the response of plant cells and in particular plant 
cells of cereal crops to various abiotic stresses. As such, the polynucleotides in a cluster may 
share common stress-regulated regulatory elements, including, for example, cold-regulated 
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regulatory elements, salinity-regulated regulatory elements, and osmotic pressure-regulated 
regulatory elements, as well as regulatory elements that are responsive to a combination of stress 
conditions, but not to any of the individual stress conditions, alone. The identification of such 
clusters of polynucleotides thus provides a means to identify the stress-regulated regulatory 

5 elements that control the level of expression of these nucleotide sequences. 

As used herein, the term "stress-regulated polynucleotide" or "stress-responsive t 
polynucleotide" means a polynucleotide sequence of a plant the transcription of which is 
altered, in response to exposure to a stress condition and/or the regulatory elements involved 
in the response. In general, such regulatory elements will be contained within a sequence 

10 including approximately two kilobases upstream of the transcription or translation start site 
and two kilobases downstream of the transcription or translation termination site. In the 
absence of a stress condition, the stress-regulated nucleotide sequence can normally be 
unexpressed in the cells, can be expressed at a basal level, which is induced to a higher level 
in response to the stress condition, or can be expressed at a level that is reduced in response 

15 to the stress condition. The coding region of a plant stress-regulated nucleotide sequence 
encodes a stress-regulated or stress-responsive polypeptide or a functional non-protein 
molecule such as a ribozyme or other functional RNA. A stress-regulated polypeptide can 
have an adaptive effect on a plant, thereby allowing the plant to better tolerate stress 
conditions; or can have a maladaptive effect, thereby decreasing the ability of the plant to 

20 tolerate the stress conditions. 

"Genome" refers to the complete genetic material of an organism, specifically a plant, 
in particular, nuclear genetic material but inclusive of plastid genetic material. 

A "functional RNA" refers to an antisense RNA, ribozyme, or other RNA that is not 
translated. 

25 The present invention provides an isolated plant stress-regulated regulatory element, 

which regulates expression of an operatively linked nucleotide sequence in a plant in 
response a stress condition. A stress-responsive or stress-regulated regulatory element is one 
that alters transcription of an operatively linked polynucleotide in response to a stress, for 
example, an abiotic stress. Such alteration includes an increase in the transcription or a 

30 decrease in transcription of the operatively linked polynucleotide. As disclosed herein, a 
plant stress-regulated regulatory element can be isolated from a polynucleotide sequence of a 
plant stress-regulated nucleotide sequence as set forth in the accompanying sequence listing 
or a functional portion of said sequence. Methods for identifying and isolating the stress- 
regulated regulatory element from the disclosed polynucleotides, or genomic DNA clones 
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corresponding thereto, are well known in the art. For example, methods of making deletion 
constructs or linker-scanner constructs can be used to identify nucleotide sequences that are 
responsive to a stress condition. Generally, such constructs include a reporter gene 
operatively linked to the sequence to be examined for regulatory activity. By performing 
5 such assays, a plant stress-regulated regulatory element can be defined within a sequence of 
about 500 nucleotides or fewer, generally at least about 200 nucleotides or fewer, particularly 
about 50 to 100 nucleotides, and more particularly at least about 20 nucleotides or fewer. In 
one embodiment, the minimal (core) sequence required for regulating a stress response of a 
plant is identified. 

10 As used herein, the term "regulatory element" or "regulatory region" means a 

nucleotide sequence that, when operatively linked to a coding region, effects transcription of 
the coding region such that a ribonucleic acid (RNA) molecule is transcribed from the coding 
region. A regulatory element generally can increase or decrease the amount of transcription 
of a nucleotide sequence, for example, a coding sequence, operatively linked to the element 

1 5 with respect to the level at which the nucleotide sequence would be transcribed absent the 
regulatory element. Regulatory elements are well known in the art and include promoters, 
enhancers, silencers, inactivated silencer intron sequences, 3'-untranslated or S'-untranslated 
sequences of transcribed sequence, for example, a poly-A signal sequence, or other protein or 
RNA stabilizing elements, or other gene expression control elements known to regulate gene 

20 expression or the amount of expression of a gene product. A regulatory element can be 

isolated from a naturally occurring genomic DNA sequence or can be synthetic, for example, 
a synthetic promoter. In one embodiment, the plant stress-regulated regulatory element is a 
plant stress-regulated promoter from a cereal. 

Regulatory elements can be constitutively expressed regulatory elements, which 

25 maintain gene expression at a relative level of activity (basal level), or can be regulated 
regulatory elements. Constitutively expressed regulatory elements can be expressed in any 
cell type, or can be tissue specific, which are expressed only in particular cell types, phase 
specific, which are expressed only during particular developmental or growth stages of a 
plant cell, or the like. A regulatory element such as a tissue specific or phase specific 

30 regulatory element or an inducible regulatory element useful in constructing a recombinant 
polynucleotide or in a practicing a method of the invention can be a regulatory element that 
generally, in nature, is found in a plant genome. However, the regulatory element also can be 
from an organism other than a plant, including, for example, from a plant virus, an animal 
virus, or a cell from an animal or other multicellular organism. 
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One well-known type of regulatory element useful in the practice of the present 
invention is the promoter. Useful promoters include, but are not limited to, constitutive, 
inducible, temporally regulated, developmentally regulated, spatially-regulated, chemically 
regulated, stress-responsive, tissue-specific, viral and synthetic promoters. Promoter 

5 sequences are known to be strong or weak. A strong promoter provides for a high level of 
gene expression, whereas a weak promoter provides for a very low level of gene expression. 
An inducible promoter is a promoter that provides for the turning on and off of gene 
expression in response to an exogenously added agent, or to an environmental or 
developmental stimulus. A bacterial promoter such as the Ptac promoter can be induced to 

1 0 varying levels of gene expression depending on the level of isothiopropylgalactoside added to 
the transformed bacterial cells. An isolated promoter sequence that is a strong promoter for 
heterologous nucleic acid is advantageous because it provides for a sufficient level of gene 
expression to allow for easy detection and selection of transformed cells and provides for a 
high level of gene expression when desired. 

15 Within a plant promoter region there are several domains that are necessary for full 

function of the promoter. The first of these domains lies immediately upstream of the 
structural gene and forms the "core promoter region" containing consensus sequences, 
normally 70 base pairs immediately upstream of the gene. The core promoter region contains 
the characteristic CAAT and TATA boxes plus surrounding sequences, and represents a 

20 transcription initiation sequence that defines the transcription start point for the structural 
gene. 

The presence of the core promoter region defines a sequence as being a promoter. 
The core promoter region, however, is insufficient to provide full promoter activity. A series 
of regulatory sequences upstream of the core constitute the remainder of the promoter. These 

25 regulatory sequences determine expression level, the spatial and temporal pattern of 

expression and, for an important subset of promoters, expression under inductive conditions 
(regulation by external factors such as light, temperature, chemicals, hormones). 

To define a minimal promoter region, a DNA segment representing the promoter 
region is removed from the 5' region of the gene of interest and operably linked to the coding 

30 sequence of a marker (reporter) gene by recombinant DNA techniques well known to the art. 
The reporter gene is operably linked downstream of the promoter, so that transcripts initiating 
at the promoter proceed through the reporter gene. Reporter genes generally encode proteins 
that are easily measured, including, but not limited to, chloramphenicol acetyl transferase 
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(CAT), beta-glucuronidase (GUS), green fluorescent protein (GFP), beta-galactosidase (beta- 
GAL), and luciferase. 

The construct containing the reporter gene under the control of the promoter is then 
introduced into an appropriate cell type by transfection techniques well known to the art. In 
5 one embodiment, cell lysates are prepared and appropriate assays, which are well known in 
the art, for the reporter protein are performed. For example, if CAT were the reporter gene of 
choice, the lysates from cells transfected with constructs containing CAT under the control of 
a promoter under study are mixed with isotopically labeled chloramphenicol and acetyl- 
coenzyme A (acetyl-CoA). The CAT enzyme transfers the acetyl group from acetyl-CoA to 

10 the 2- or 3-position of chloramphenicol. The reaction is monitored by thin-layer 

chromatography, which separates acetylated chloramphenicol from unreacted material. The 
reaction products are then visualized by autoradiography. 

The level of enzyme activity corresponds to the amount of enzyme that was made, 
which in turn reveals the level of expression from the promoter of interest. This level of 

1 5 expression can be compared to other promoters to determine the relative strength of the 
promoter under study. In order to be sure that the level of expression is determined by the 
promoter, rather than by the stability of the mRNA, the level of the reporter mRNA can be 
measured directly, such as by Northern blot analysis. 

Once activity is detected, mutational and/or deletional analyses may be employed to 

20 determine the minimal region and/or sequences required to initiate transcription. Thus, 
sequences can be deleted at the 5* end of the promoter region and/or at the 3' end of the 
promoter region, and nucleotide substitutions introduced. These constructs are then 
introduced to cells and their activity determined. 

The choice of promoter will vary depending on the temporal and spatial requirements 
25 for expression, and also depending on the target species. In some cases, expression in 

multiple tissues is desirable. While in others, tissue-specific, e.g., leaf-specific, seed-specific, 
petal-specific, anther-specific, or pith-specific, expression is desirable. Although many 
promoters from dicotyledons have been shown to be operational in monocotyledons and vice 
versa, typically dicotyledonous promoters are selected for expression in dicotyledons, and 
30 monocotyledonous promoters for expression in monocotyledons. There is, however, no 
restriction to the provenance of selected-promoters. It is sufficient that the promoters are- 
operational in driving the expression of the nucleotide sequences in the desired cell. 
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A range of naturally-occurring promoters are known to be operative in plants and 
have been used to drive the expression of heterologous (both foreign and endogenous) genes 
and nucleotide sequences in plants: for example, the constitutive 35S cauliflower mosaic 
virus (CaMV) promoter, the ripening-enhanced tomato polygalacturonase promoter (Schuch 
5 et al., Plant MoL Biol.. 13:303, 1989), the E8 promoter (Diekman & Fischer, EMBO J., 
7:3315 1988) and the fruit specific 2A1 promoter (Pear et al., Plant MoL Biol. . 13:639, 
1989). Many other promoters, e.g., U2 and U5 snRNA promoters from maize, the promoter 
from alcohol dehydrogenase, the Z4 promoter from a gene encoding the Z4 22 kD zein 
protein, the Z10 promoter from a gene encoding a 10 kD zein protein, a Z27 promoter from a 

10 gene encoding a 27 kD zein protein, the A20 promoter from the gene encoding a 19 kD -zein 
protein, inducible promoters, such as the light inducible promoter derived from the pea rbcS 
gene and the actin promoter from rice, e.g., the actin 2 promoter (WO 00/70067); seed 
specific promoters, such as the phaseolin promoter from beans, may also be used. The stress- 
regulated nucleotide sequences of this invention can also be expressed under the regulation of 

1 5 promoters that are chemically regulated. This enables the nucleic acid sequence or encoded 
polypeptide to be synthesized only when the crop plants are treated with the inducing 
chemicals. Chemical induction of gene expression is detailed in EP 0 332 104 and U.S. 
Patent 5,614,395. 

In some instances it may be desirable to link a constitutive promoter to the stress 
20 regulated nucleotide sequences of the present invention. Examples of some constitutive 

promoters which have been described include the rice actin 1 (Wang et al., Molec. Cell Biol. . 
12:3399, 1992; U.S. Patent No. 5,641,876), CaMV 35S (Odell et al., Nature . 313:810, 1985), 
CaMV 19S (Lawton et al., MoL Cell Biol.. 7:335, 1987), nos, Adh, sucrose synthase; and the 
ubiquitin promoters. 

25 In other situations it may be desirable to limit expression of stress-related sequences 

to specific tissues or stages of development. As used herein, the term "tissue specific or 
phase specific regulatory element" means a nucleotide sequence that effects transcription in 
only one or a few cell types, or only during one or a few stages of the life cycle of a plant, for 
example, only for a period of time during a particular stage of growth, development or 

30 differentiation. The terms "tissue specific" and "phase specific" are used together herein in 
referring to a regulatory element because a single regulatory element can have characteristics 
of both types of regulatory elements. For example, a regulatory element active only during a 
particular stage of plant development also can be expressed only in one or a few types of cells 
in the plant during the particular stage of development. As such, any attempt to classify such 
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regulatory elements as tissue specific or as phase specific can be difficult. Accordingly, 
unless indicated otherwise, all regulatory elements having the characteristic of a tissue 
specific regulatory element, or a phase specific regulatory element, or both are considered 
together for purposes of the present invention. 
5 Examples of tissue specific promoters which have been described include the lectin 

(Vodkin, Prog. Clin Biol. Res.. 138:87, 1983; Lindstrom et al. f Per. Genet., 11:160, 1990) 
corn alcohol dehydrogenase 1 (Vogel et al., EMBO J.. 1 1 :157, 1992; Dennis et al., Nuc. Acid 
Res., 12:3983, 1984), corn light harvesting complex (Bansal et al., Proc. Natl. Acad. Sci. 
USA . 89:3654, 1992), corn heat shock protein (Odell et al., Nature, 313:810, 1985), pea 

1 0 small subunit RuBP carboxylase (Poulsen et al., Mol. Gen. Genet., 205: 1 93, 1 986), Ti 

plasmid mannopine synthase (Langridge et al., Proc. Natl. Acad, Sci.. USA. 86:3219, 1989), 
Ti plasmid nopaline synthase (Langridge et al., Proc. Natl. Acad. Sci.. USA . 86:3219, 1989), 
petunia chalcone isomerase (vanTunen et al, EMBO J.. 7:1257, 1988), bean glycine rich 
protein 1 (Keller et al., Genes Dev.. 3:1639, 1989), truncated CaMV 35s (Odell et al., Nature . 

15 313:810, 1985), potato patatin (Wenzler et al., Plant Molec. Biol.. 13:347, 1989), root cell 
(Yamamoto et al., Nuc. Acid Res. . 18:7449, 1990), maize zein (Reina et al., Nuc. Acids Res,, 
18:6425-26, 1990; Kriz et al., Mol. Gen. Genet. . 207:90, 1987; Wandelt et al, Nuc, Acids 
Res., 17:2354, 1989; Langridge et al., Cell, 34:1015, 1983; Reina et al., Nuc. Acids Res.. 
18:6425-26 1990), globulin-1 (Belanger et al., Genetics. 129:863, 1991), a-tubulin, cab 

20 (Sullivan et al., Mol. Gen. Genet.. 215:431, 1989), PEPCase (Hudspeth & Grula, Plant 
Molec. Biol. . 12:579, 1989), R gene complex-associated promoters (Chandler et al., Plant 
Cell. 1:1 175, 1989), histone, and chalcone synthase promoters (Franken et al., EMBO J.. 
10:2605, 1991). Tissue specific enhancers are described in Fromm et al. Bio/Technology, 
8:833, 1989. 

25 Several other tissue-specific regulated genes and/or promoters have been reported in 

plants. These include genes encoding the seed storage proteins (such as napin, cruciferin, 
beta-conglycinin, and phaseolin) zein or oil body proteins (such as oleosin), or genes 
involved in fatty acid biosynthesis (including acyl carrier protein, stearoyl-ACP desaturase, 
fatty acid desaturases (fad 2-1)), and other genes expressed during embryo development 

30 (such as Bce4, see, for example, EP 255378 and Kridl et al., Seed Sci. Res.. 1 :209, 1991). 
Particularly useful for seed-specific expression is the pea vicilin promoter (Czako et al., Mol. 
Gen. Genet., 235:33, 1992). (See also U.S. Pat. No. 5,625,136, herein incoiporated by 
reference.) Other useful promoters for expression in mature leaves are those that are 
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switched on at the onset of senescence, such as the SAG promoter from Arabidopsis (Gan et 
al., Science, 270:1986, 1995). 

A class of fruit-specific promoters expressed at or during antithesis through fruit 
development, at least until the beginning of ripening, is discussed in U.S. 4,943,674. cDNA 
5 clones that are preferentially expressed in cotton fiber have been isolated (John et al., Proc. 
Natl. Acad. Sci. USA. 89:5769, 1992). cDNA clones from tomato displaying differential 
expression during fruit development have been isolated and characterized (Mansson et al, 
Gen. Genet., 200:356, 1985, Slater et al, Plant Molec. Biol, 5:137, 1985). The promoter for 
polygalacturonase gene is active in fruit ripening. The polygalacturonase gene is described in 

10 U.S. Patent No. 4,535,060, U.S. Patent No. 4,769,061, U.S. Patent No. 4,801,590, and U.S. 
Patent No. 5,107,065, which disclosures are incorporated herein by reference. 

Other examples of tissue-specific promoters include those that direct expression in 
leaf cells following damage to the leaf (for example, from chewing insects), in tubers (for 
example, patatin gene promoter), and in fiber cells (an example of a developmentally- 

1 5 regulated fiber cell protein is E6 (John et al., Proc. Natl Acad. Sci. USA. 89:5769, 1 992). 
The E6 gene is most active in fiber, although low levels of transcripts are found in leaf, ovule 
and flower. 

Additional tissue specific or phase specific regulatory elements include, for example, 
the AGL8/FRUITFULL regulatory element, which is activated upon floral induction (Hempel 

20 et al, Development 124:3845-3853, 1997) root specific regulatory elements such as the 
regulatory elements from the RCP1 gene and the LRP1 gene (Tsugeki and Fedoroff, Proc. 
Natl. Acad.. USA 96:12941-12946, 1999; Smith and Fedoroff, Plant Cell 7:735, 1995); 
flower specific regulatory elements such as the regulatory elements from the LEAFY gene 
and the APETELA1 gene (Blazquez et al, Development 124:3835-3844, 1997; Hempel et al., 

25 supra, 1997); seed specific regulatory elements such as the regulatory element from the 
oleosin gene (Plant et al., Plant Mol. Biol. 25:193-205, 1994), and dehiscence zone specific 
regulatory element. Additional tissue specific or phase specific regulatory elements include 
the Znl3 promoter, which is a pollen specific promoter (Hamilton et al., Plant Mol. Biol. 
18:211-218, 1992); the UNUSUAL FLORAL ORGANS (UFO) promoter, which is active in 

30 apical shoot meristem; the promoter active in shoot meristems (Atanassova et al., Plant J. 
2:291, 1992), the cdc2a promoter and cyc07 promoter (see, for example, Ito et al, Plant Mol. 
Biol 24:863, 1994; Martinez et al, Proc. Natl. Acad. Sci.. USA 89:7360, 1992; Medford et 
al., Plant Cell 3:359, 1991; Terada et al., Plant J. 3:241, 1993; Wissenbach et al., Plant J. 
4:41 1, 1993); the promoter of the APETELA3 gene, which is active in floral meristems (Jack 



WO 03/008540 PCTYUS02/19668 

34 

et al. 3 Cell 76:703, 1994; Hempel et al., supra, 1997); a promoter of an agamous-like (AGL) 
family member, for example, AGL8, which is active in shoot meristem upon the transition to 
flowering (Hempel et al., supra, 1997); floral abscission zone promoters; Ll-specific 
promoters; and the like. 

5 The tissue-speci ficity of some "tissue-specific" promoters may not be absolute and 

may be tested by one skilled in the art using the diphtheria toxin sequence. One can also 
achieve tissue-specific expression with "leaky" expression by a combination of different 
tissue-specific promoters (Beals et al., Plant Cell 9:1527, 1997). Other tissue-specific 
promoters can be isolated by one skilled in the art (see U.S. 5,589,379). Several inducible 

10 promoters ("gene switches") have been reported, many of which are described in the review 
by Gatz (Cur. Opin. Biotech. 7:168, 1996) and Gatz (Ann. Rev. Plant. Phvsiol Plant Mol. 
Biol., 48:89, 1997). These include tetracycline repressor system, Lac repressor system, 
copper-inducible systems, salicylate-inducible systems (such as the PRla system), 
glucocorticoid- (Aoyama et al., N-H Plant J. , 1 1 :605, 1997) and ecdysome-inducible systems. 

15 Also included are the benzene sulphonamide- (U.S. Patent No. 5,364,780) and alcohol-(WO 
97/06269 and WO 97/06268) inducible systems and glutathione S-transferase promoters. 

In some instances it might be desirable to inhibit expression of a native DNA 
sequence within a plant's tissues to achieve a desired phenotype. In this case, such inhibition 
might be accomplished with transformation of the plant to comprise a constitutive, tissue- 

20 independent promoter operably linked to an antisense nucleotide sequence, such that 

constitutive expression of the antisense sequence produces an RNA transcript that interferes 
with translation of the mRNA of the native DNA sequence. 

Inducible regulatory elements also are useful for purposes of the present invention. 
As used herein, the term "inducible regulatory element" means a regulatory element that, 

25 when exposed to an inducing agent, effects an increased level of transcription of a nucleotide 
sequence to which it is operatively linked as compared to the level of transcription, if any, in 
the absence of an inducing agent. Inducible regulatory elements can be those that have no 
basal or constitutive activity and only effect transcription upon exposure to an inducing 
agent, or those that effect a basal or constitutive level of transcription, which is increased 

30 upon exposure to an inducing agent. Inducible regulatory elements that effect a basal or 
constitutivelevel of expression generally are useful in a method or composition of the 
invention where the induced level of transcription is substantially greater than the basal or 
constitutive level of expression, for example, at least about two-fold greater, or at least about 
five-fold greater. Particularly useful inducible regulatory elements do not have a basal or 
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constitutive activity, or increase the level of transcription at least about ten-fold greater than a 
basal or constitutive level of transcription associated with the regulatory element. 

Inducible promoters that have been described include the ABA- and turgor-inducible 
promoters, the promoter of the auxin-binding protein gene (Schwob et al., Plant J. » 4:423, 
5 1993), the UDP glucose flavonoid glycosyl-transferase gene promoter (Ralston et ah, 
Genetics. 1 19:185, 1988), the MPI proteinase inhibitor promoter (Cordero et al., Plant J., 
6:141, 1994), and the glyceraldehyde-3-phosphate dehydrogenase gene promoter (Kohler et 
al., Plant Molec. Biol.. 29:1293, 1995; Quigley et ah, J. Mol. EvoL 29:412, 1989; Martinez 
et al., J. Mol. Biol. 208:551, 1989). 

10 The term "inducing agent" is used to refer to a chemical, biological or physical agent 

or environmental condition that effects transcription from an inducible regulatory element. In 
response to exposure to an inducing agent, transcription from the inducible regulatory 
element generally is initiated de novo or is increased above a basal or constitutive level of 
expression. Such induction can be identified using the methods disclosed herein, including 

1 5 detecting an increased level of RNA transcribed from a nucleotide sequence operatively 
linked to the regulatory element, increased expression of a polypeptide encoded by the 
nucleotide sequence, or a phenotype conferred by expression of the encoded polypeptide. 

An inducing agent useful in a method of the invention is selected based on the 
particular inducible regulatory element. For example, the inducible regulatory element can 

20 be a metallothionein regulatory element, a copper inducible regulatory element or a 

tetracycline inducible regulatory element, the transcription from which can be effected in 
response to metal ions, copper or tetracycline, respectively (Furst et al., CeU 55 :705-7 1 7, 
1988; Mett et al., Proc. Natl. Acad. ScLUSA 90:4567-4571, 1993; Gatz et al., Plant J. 2:397- 
404, 1992; Roder et al., Mol. Gen. Genet. 243:32-38, 1994). The inducible regulatory 

25 element also can be an ecdysone regulatory element or a glucocorticoid regulatory element, 
the transcription from which can be effected in response to ecdysone or other steroid 
(Christopherson et al., Proc. Natl. Acad. ScLUSA 89:6314-6318, 1992; Schena et al., Proc. 
Natl. Acad. ScL USA 88:10421-10425, 1991). In addition, the regulatory element can be a 
cold responsive regulatory element or a heat shock regulatory element, the transcription of 

30 which can be effected in response to exposure to cold or heat, respectively (Takahashi et al, 
Plant Phvsiol. 99:383-390. 1992). Additional regulatory elements useful in the methods or 
compositions of the invention include, for example, the spinach nitrite reductase gene 
regulatory element (Back et al., Plant Mol. Biol. 17:9, 1991); a light inducible regulatory 
element (Feinbaum et al., Mol. Gen. Genet. 226:449, 1991; Lam and Chua, Science 248:471, 
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1990), a plant hormone inducible regulatory element (Yamaguchi-Shinozaki et al, Plant Mol, 
BioL 15:905, 1990; Kares et al., Plant Mol. BioL 15:225, 1990), and the like. 

An inducible regulatory element also can be a plant stress-regulated regulatory 
element of the invention, In addition to the known stress conditions that specifically induce 
5 or repress expression from such elements, the present invention provides methods of 
identifying agents that mimic a stress condition. Accordingly, such stress mimics are 
considered inducing or repressing agents with respect to a plant stress-regulated regulatory 
element. In addition, a recombinant polypeptide comprising a zinc finger domain, which is 
specific for the regulatory element, and an effector domain, particularly an activator, can be 

10 useful as an inducing agent for a plant stress-regulated regulatory element. Furthermore, 
such a recombinant polypeptide provides the advantage that the effector domain can be a 
repressor domain, thereby providing a repressing agent, which decreases expression from the 
regulatory element. In addition, use of such a method of modulating expression of an 
endogenous plant stress-regulated nucleotide sequence provides the advantage that the 

1 5 polynucleotide encoding the recombinant polypeptide can be introduced into cells of the 
plant, thus providing a transgenic plant that can be regulated coordinately with the 
endogenous plant stress-regulated nucleotide sequence upon exposure to a stress condition. 
A polynucleotide encoding such a recombinant polypeptide can be operatively linked to and 
expressed from a constitutively active, inducible or tissue specific or phase specific 

20 regulatory element. 

In one embodiment, the promoter may be a gamma zein promoter, an oleosin olel6 
promoter, a globulinl promoter, an actin I promoter, an actin cl promoter, a sucrose 
synthetase promoter, an INOPS promoter, an EXM5 promoter, a globulin2 promoter, a b-32, 
ADPG- pyrophosphorylase promoter, an Ltpl promoter, an Ltp2 promoter, an oleosin olel7 

25 promoter, an oleosin olel 8 promoter, an actin 2 promoter, a pollen-specific protein promoter, 
a pollen-specific pectate lyase promoter, an anther-specific protein promoter, an anther- 
specific gene RTS2 promoter, a pollen- specific gene promoter, a tapeturn-specific gene 
promoter, tapeturn- specific gene RAB24 promoter, a anthranilate synthase alpha subunit 
promoter, an alpha zein promoter, an anthranilate synthase beta subunit promoter, a 

30 dihydrodipicolinate synthase promoter, a Thil promoter, an alcohol dehydrogenase promoter, 
a cab binding protein promoter, an H3C4 promoter, a RUBISCO SS starch branching enzyme 
promoter, an ACCase promoter, an actin3 promoter, an actin7 promoter, a regulatory protein 
GF14-12 promoter, a ribosomal protein L9 promoter, a cellulose biosynthetic enzyme 
promoter, an S-adenosyl-L-homocysteine hydrolase promoter, a superoxide dismutase 
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promoter, a C-kinase receptor promoter, a phosphoglycerate mutase promoter, a root-specific 
RCc3 mRNA promoter, a glucose-6 phosphate isomerase promoter, a pyrophosphate-fructose 
6-phosphatelphosphotransferase promoter, an ubiquitin promoter, a beta-ketoacyl-ACP 
synthase promoter, a 33 kDa photosystem 1 1 promoter, an oxygen evolving protein 
5 promoter, a 69 kDa vacuolar ATPase subunit promoter, a metallothionein-like protein 
promoter, a glyceraldehyde-3 -phosphate dehydrogenase promoter, an ABA- and ripening- 
inducible-like protein promoter, a phenylalanine ammonia lyase promoter, an adenosine 
triphosphatase S-adenosyl-L-homocysteine hydrolase promoter, an a- tubulin promoter, a cab 
promoter, a PEPCase promoter, an R gene promoter, a lectin promoter, a light harvesting 

1 0 complex promoter, a heat shock protein promoter, a chalcone synthase promoter, a zein 
promoter, a globulin- 1 promoter, an ABA promoter, an auxin-binding protein promoter, a 
UDP glucose flavonoid glycosyl-transferase gene promoter, an NTI promoter, an actin 
promoter, an opaque 2 promoter, a b70 promoter, an oleosin promoter, a CaMV 35S 
promoter, a CaMV 19S promoter, a histone promoter, a turgor-inducible promoter, a pea 

1 5 small subunit RuBP carboxylase promoter, a Ti plasmid mannopine synthase promoter, Ti 
plasmid nopaline synthase promoter, a petunia chalcone isomerase promoter, a bean glycine 
rich protein I promoter, a CaMV 35S transcript promoter, a potato patatin promoter, or a S- 
E9 small subunit RuBP carboxylase promoter. 

In addition to promoters, a variety of 5' and 3' transcriptional regulatory sequences are 

20 also available for use in the present invention. Transcriptional terminators are responsible for 
the termination of transcription and correct mRNA polyadenylation. The 3* nontranslated 
regulatory DNA sequence usually includes from about 50 to about 1,000, typically about 100 
to about 1,000, nucleotide base pairs and contains plant transcriptional and translation^ 
termination sequences. Appropriate transcriptional terminators and those which are known to 

25 function in plants include the CaMV 35S terminator, the tml terminator, the nopaline 

synthase terminator, the pea rbcS E9 terminator, the terminator for the T7 transcript from the 
octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease inhibitor 
I or II genes from potato or tomato, although other 3' elements known to those of skill in the 
art can also be employed. Alternatively, one also could use a gamma coixin, oleosin 3 or 

30 other terminator from the genus Coix. 

Suitable 3 ' elements include those from the nopaline synthase gene of Agrobacterium 
tumefaciens (Bevan et al., Nature. 304:184, 1983), the terminator for the T7 transcript from 
the octopine synthase gene of Agrobacterium tumefaciens, and the 3' end of the protease 
inhibitor I or II genes from potato or tomato. 
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As the DNA sequence between the transcription initiation site and the start of the 
coding sequence, i.e., the untranslated leader sequence, can influence gene expression, one 
may also wish to employ a particular leader sequence. Suitable leader sequences are 
contemplated to include those that comprise sequences predicted to direct optimum 
5 expression of the attached sequence, i.e., to include a consensus leader sequence that may 
increase or maintain mRNA stability and prevent inappropriate initiation of translation. The 
choice of such sequences will be known to those of skill in the art in light of the present 
disclosure. Sequences that are derived from genes that are highly expressed in plants are 
desirable. 

10 Other sequences that have been found to enhance gene expression in transgenic plants 

include intron sequences (e.g., from Adhl, bronzel, actinl, actin 2 (WO 00/760067), or the 
sucrose synthase intron) and viral leader sequences (e.g., from TMV, MCMV and AMV). 
For example, a number of non-translated leader sequences derived from viruses are known to 
enhance expression. Specifically, leader sequences from Tobacco Mosaic Virus (TMV), 

1 5 Maize Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown 
to be effective in enhancing expression (e.g., Gallie et al., Nuc. Acids Res., 15:8693, 1987; 
Skuzeski et al., Plant Mol. Biol.. 15:65, 1990). Other leaders known in the art, include but 
are not limited to: Picornavirus leaders, for example, EMCV leader (Encephalomyocarditis 5 
noncoding region) (Elroy-Stein et al., Proc. Natl. Acad. Sci. USA. 86:6126, 1989); Potyvirus 

20 leaders, for example, TEV leader (Tobacco Etch Virus); MDMV leader (Maize Dwarf 

Mosaic Virus); Human immunoglobulin heavy-chain binding protein (BiP) leader, (Macejak 
et al., Nature, 353:90, 1991); Untranslated leader from the coat protein mRNA of alfalfa 
mosaic virus (AMV RNA 4), (Jobling et al., Nature. 325:622, 1987; Tobacco mosaic virus 
leader (TMV), (Gallie et al., Molecular Biology of RNA. 237 1989; and Maize Chlorotic 

25 Mottle Virus leader (MCMV) (Lommel et al., Virology. 81:382, 1 991 . See also, Della- 
Cioppa et al., Plant PhvsioL 84:965, 1987. 

Regulatory elements such as Adh intron 1 (Callis et al, Genes DeveL 1:1183, 1987), 
sucrose synthase intron (Vasil et al., Plant Physiol.. 91:1 575, 1989) or TMV omega element 
(Gallie, et al., Molecular Biology of RNA. 237 1989 1989), may further be included where 

30 desired. 

Examples of enhancers include elements from the CaMV 35S promoter, octopine 
synthase genes (Ellis el al., EMBO J.. 6:3203, 1987), the rice actin I gene, the maize alcohol 
dehydrogenase gene (Callis et al., Genes DeveL 1:1 183, 1987), the maize shrunken I gene 
(Vasil et al., Plant PhvsioL 91 : 1575, 1989), TMV Omega element (Gallie et al, Molecular 
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Biology of RNA , 1989) and promoters from non-plant eukaryotes (e.g. yeast; Ma et al., 
Nature, 334:631, 1988). 

Vectors for use in accordance with the present invention may be constructed to 
include the ocs enhancer element. This element was first identified as a 16 bp palindromic 
5 enhancer from the octopine synthase (ocs) gene of ultilane (Ellis et al, EMBOJ., 6:3203, 
1987), and is present in at least 10 other promoters (Bouchez et al., EMBO J., 8:4197, 1989). 
The use of an enhancer element, such as the ocs element and particularly multiple copies of 
the element, will act to increase the level of transcription from adjacent promoters when 
applied in the context of monocot transformation. 

1 0 The methods of the invention provide genetically modified plant cells, which can 

contain, for example, a coding region, or functional portion thereof, of a plant stress- 
regulated polynucleotide operatively linked to a heterologous inducible regulatory element; 
or a plant stress-regulated regulatory element operatively linked to a heterologous nucleotide 
sequence encoding a polypeptide of interest. In such a plant, the expression from the 

1 5 inducible regulatory element can be effected by exposing the plant cells to an inducing agent 
in any of numerous ways depending, for example, on the inducible regulatory element and 
the inducing agent. For example, where the inducible regulatory element is a cold responsive 
regulatory element present in the cells of a transgenic plant, the plant can be exposed to cold 
conditions, which can be produced artificially, for example, by placing the plant in a 

20 thermostatically controlled room, or naturally, for example, by planting the plant in an 
environment characterized, at least in part, by attaining temperatures sufficient to induce 
transcription from the promoter but not so cold as to kill the plants. By examining the 
phenotype of such transgenic plants, those plants that ectopically express a gene product that 
confers increased resistance of the plant to cold can be identified. Similarly, a transgenic 

25 plant containing a metallothionein promoter can be exposed to metal ions such as cadmium 
or copper by watering the plants with a solution containing the inducing metal ions, or can be 
planted in soil that is contaminated with a level of such metal ions that is toxic to most plants. 
The phenotype of surviving plants can be observed, those expressing desirable traits can be 
selected. 

30 As used herein, the term "phenotype" refers to a physically detectable characteristic. 

A phenotype can be identified visually by inspecting the physical appearance of a plant 
following exposure; for" example, to increased osmotic conditions; can be identified using an 
assay to detecting a product produced due to expression of reporter gene, for example, an 
RNA molecule, a polypeptide such as an enzyme, or other detectable signal such as disclosed 
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herein; or by using any appropriate tool useful for identifying a phenotype of a plant, for 
example, a microscope, a fluorescence activated cell sorter, or the like. 

A transgenic plant containing an inducible regulatory element such as a steroid 
inducible regulatory element can be exposed to a steroid by watering the plants with a 
5 solution containing the steroid. The use of an inducible regulatory element that is induced 
upon exposure to a chemical or biological inducing agent that can be placed in solution or 
suspension in an aqueous medium can be particularly useful because the inducing agent can 
be applied conveniently to a relatively large crop of transgenic plants containing the inducible 
regulatory element, for example, through a watering system or by spraying the inducing 

10 agent over the field. As such, inducible regulatory elements that are responsive to an 

environmental inducing agent, for example, cold; heat; metal ions or other potentially toxic 
agents such as a pesticides, which can contaminate a soil; or the like; or inducible regulatory 
elements that are regulated by inducing agents that conveniently can be applied to plants, can 
be particularly useful in a method or composition of the invention, and allow the 

15 identification and selection of plants that express desirable traits and survive and grow in 
environments that otherwise would not support growth of the plants. 

For purposes of modulating the responsiveness of a plant to a stress condition, it can 
be useful to introduce a modified plant stress-regulated regulatory element into a plant. Such 
a modified regulatory element can have any desirable characteristic, for example, it can be 

20 inducible to a greater level than the corresponding wild-type promoter, or it can be 

inactivated such that, upon exposure to a stress, there is little or no induction of expression of 
a nucleotide sequence operatively linked to the mutant element. A plant stress-regulated 
regulatory element can be modified by incorporating random mutations using, for example, 
in vitro recombination or DNA shuffling (Stemmer et al, Nature 370: 389-391, 1994; U.S. 

25 Patent No. 5,605,793). Using such a method, millions of mutant copies of the 

polynucleotide, for example, stress-regulated regulatory element, can be produced based on 
the original nucleotide sequence, and variants with improved properties, such as increased 
inducibility can be recovered. 

A mutation method such as DNA shuffling encompasses forming a mutagenized 

30 double-stranded polynucleotide from a template double-stranded polynucleotide, wherein the 
template double-stranded polynucleotide has been cleaved into double stranded random 
fragments of a desired size, and comprises the steps of adding to the resultant population of 
double-stranded random fragments one or more single or double stranded oligonucleotides, 
wherein the oligonucleotides comprise an area of identity and an area of heterology to the 



WO 03/008540 PCT/US02/19668 

41 

double stranded template polynucleotide; denaturing the resultant mixture of double stranded 
random fragments and oligonucleotides into single stranded fragments; incubating the 
resultant population of single stranded fragments with a polymerase under conditions that 
result in the annealing of the single stranded fragments at the areas of identity to form pairs 
5 of annealed fragments, the areas of identity being sufficient for one member of a pair to 
prime replication of the other, thereby forming a mutagenized double-stranded 
polynucleotide; and repeating the second and third steps for at least two further cycles, 
wherein the resultant mixture in the second step of a further cycle includes the mutagenized 
double-stranded polynucleotide from the third step of the previous cycle, and the further 

10 cycle forms a further mutagenized double-stranded polynucleotide. Typically, the 

concentration of a single species of double stranded random fragment in the population of 
double stranded random fragments is less than 1% by weight of the total DNA. In addition, 
the template double stranded polynucleotide can comprise at least about 100 species of 
polynucleotides. The size of the double stranded random fragments can be from about 5 base 

1 5 pairs to 5 kilobase pairs. In a further embodiment, the fourth step of the method comprises 
repeating the second and the third steps for at least 10 cycles. 

A plant stress-regulated regulatory element of the invention is useful for expressing a 
nucleotide sequence operatively linked to the element in a cell, particularly a plant cell. As 
used herein, the term "expression" refers to the transcription and/or translation of an 

20 endogenous gene or a transgene in plants. In the case of an antisense molecule, for example, 
the term "expression" refers to the transcription of the polynucleotide encoding the antisense 
molecule. 

As used herein, the term "operatively linked," when used in reference to a plant 
stress-regulated regulatory element, means that the regulatory element is positioned with 

25 respect to a second nucleotide sequence such that the regulatory element effects transcription 
or transcription and translation of the nucleotide sequence in substantially the same manner, 
but not necessarily to the same extent, as it does when the regulatory element is present in its 
natural position in a genome. Transcriptional promoters, for example, generally act in a 
position and orientation dependent manner and usually are positioned at or within about five 

30 nucleotides to about fifty nucleotides 5' (upstream) of the start site of transcription of a gene 
in nature. In comparison, enhancers and silencers can act in a relatively position or 
orientation independent manner and, therefore, can be positioned several hundred or thousand 
nucleotides upstream or downstream from a transcription start site, or in an intron within the 
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coding region of a gene, yet still be operatively linked to a coding region so as to effect 
transcription. 

The second nucleotide sequence, i.e., the sequence operatively linked to the plant 
stress-regulated regulatory element, can be any nucleotide sequence, including, for example, 
5 a coding region of a gene or cDNA; a sequence encoding an antisense molecule, an RNAi 
molecule, ribozyme, triplexing agent (see, for example, Frank-Kamenetskii and Mirkin, Ann. 
Rev. Biochem. 64:65-95, 1995), or the like; or a sequence that, when transcribed, can be 
detected in the cell using, for example, by hybridization or amplification, or when translated 
produces a detectable signal. The term "coding region" is used broadly herein to include a 

10 nucleotide sequence of a genomic DNA or a cDNA molecule comprising all or part of a 
coding region of the coding strand. A coding region can be transcribed from an operatively 
linked regulatory element, and can be translated into a full-length polypeptide or a peptide 
portion of a polypeptide, preferably a peptide portion having the same functional 
characteristics as the full-length polypeptide. It should be recognized that, in a nucleotide 

1 5 sequence comprising a coding region, not all of the nucleotides in the sequence need 

necessarily encode the polypeptide and, particularly, that a gene transcript can contain one or 
more introns, which do not encode an amino acid sequence of a polypeptide but, 
nevertheless, are part of the coding region, particularly the coding strand, of the gene. 

The present invention also relates to a recombinant polynucleotide, which contains a 

20 functional portion of a plant stress-regulated nucleotide sequence operatively linked to a 
heterologous nucleotide sequence. As used herein, the term "functional portion" of plant 
stress-regulated sequence means a contiguous nucleotide sequence of the plant stress- 
regulated sequence that provides a function within a plant or plant cell. The portion can be 
any portion of the sequence, particularly a coding sequence, or a sequence encoding a peptide 

25 portion of the stress-regulated polypeptide; the stress-regulated regulatory element such as a 
promoter or minimal promoter; a sequence useful as an antisense molecule or triplexing 
agent; or a sequence useful for disrupting (knocking-out) an endogenous plant stress- 
regulated nucleotide sequence. 

A heterologous nucleotide sequence is a nucleotide sequence that is not normally part 

30 of the plant stress-regulated polynucleotide from which the functional portion of the plant 
streps-regulated polynucleotide-component of the recombinant polynucleotide is obtained; or, 
if it is a part of the plant stress-regulated polynucleotide sequence from which the functional 
portion is obtained, it is an orientation other than it would normally be in, for example, is an 
. antisense sequence, or comprises at least partially discontinuous as compared to the genomic 
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structure, for example, a single exon operatively linked to the regulatory element. In general, 
where the functional portion of the plant stress-regulated nucleotide sequence comprises the 
coding sequence in a recombinant polynucleotide of the invention, the heterologous 
nucleotide sequence will function as a regulatory element. The regulatory element can be 
5 any heterologous regulatory element, including, for example, a constitutively active 
regulatory element, an inducible regulatory element, or a tissue specific or phase specific 
regulatory element, as disclosed above. Conversely, where the functional portion of the plant 
stress-regulated polynucleotide comprises the stress-regulated regulatory element of a 
recombinant polynucleotide of the invention, the heterologous nucleotide sequence generally 

1 0 will be a nucleotide sequence that can be transcribed and, if desired, translated. Where the 
heterologous nucleotide sequence is expressed from a plant stress-regulated regulatory 
element, it generally confers a desirable phenotype to a plant cell containing the recombinant 
polynucleotide, or provides a means to identify a plant cell containing the recombinant 
polynucleotide. It should be recognized that a "desirable" phenotype can be one that 

1 5 decreases the ability of a plant cell to compete where the plant cell, or a plant containing the 
cell, is an undesired plant cell. Thus, a heterologous nucleotide sequence can allow a plant to 
grow, for example, under conditions in which it would not normally be able to grow. 

A heterologous nucleotide sequence can be, or encode, a selectable marker. As used 
herein, the term "selectable marker" is used herein to refer to a molecule that, when present 

20 or expressed in a plant cell, provides a means to identify a plant cell containing the marker. 
As such, a selectable marker can provide a means for screening a population of plants, or 
plant cells, to identify those having the marker. A selectable marker also can confer a 
selective advantage to the plant cell, or a plant containing the cell. The selective advantage 
can be, for example, the ability to grow in the presence of a negative selective agent such as 

25 an antibiotic or herbicide, compared to the growth of plant cells that do not contain the 

selectable marker. The selective advantage also can be due, for example, to an enhanced or 
novel capacity to utilize an added compound as a nutrient, growth factor or energy source. A 
selectable advantage can be conferred, for example, by a single polynucleotide, or its 
expression product, or to a combination of polynucleotides whose expression in a plant cell 

30 gives the cell with a positive selective advantage, a negative selective advantage, or both. 

Examples of selectable markers include those that confer antimetabolite resistance, 
for example, dihydrofolate reductase, which confers resistance to methotrexate (Reiss, Plant 
Physiol (Life Sci. Adv.) 13:143-149, 1994); neomycin phosphotransferase, which confers 
resistance to the aminoglycosides neomycin, kanamycin and paromycin (Herrera-Estrella, 
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EMBO J. 2:987-995, 1983) and hygro, which confers resistance to hygromycin (Marsh, Gene 
32:481-485, 1984), trpB, which allows cells to utilize indole in place of tryptophan; hisD, 
which allows cells to utilize histinol in place of histidine (Hartman, Proc. Natl. Acad, ScL 
USA 85:8047, 1988); mannose-6-phosphate isomerase which allows cells to utilize mannose 
5 (WO 94/20627); ornithine decarboxylase, which confers resistance to the ornithine 

decarboxylase inhibitor, 2-(difluoromethyI)-DL-ornithine (DFMO; McConlogue, 1987, In: 
Current Communications in Molecular Biology. Cold Spring Harbor Laboratory ed.); and 
deaminase from Aspergillus terreus, which confers resistance to Blasticidin S (Tamura, 
Biosci. Biotechnol. Biochem. 59:2336-2338, 1995). Additional selectable markers include 

1 0 those that confer herbicide resistance, for example, phosphinothricin acetyltransferase gene, 
which confers resistance to phosphinothricin (White et al., Nucl. Acids Res. 18:1062, 1990; 
Spencer et al, Theor. AppL Genet. 79:625-631, 1990), a mutant EPSPV-synthase, which 
confers glyphosate resistance (Hinchee et al., Bio/Technologv 91 :91 5-922, 1998), a mutant 
acetolactate synthase, which confers imidazolione or sulfonylurea resistance (Lee et al., 

15 EMBO J. 7:1241-1248, 1988), a mutant psbA, which confers resistance to atrazine (Smeda et 
al., Plant Physiol. 103:91 1-917, 1993), or a mutant protoporphyrinogen oxidase (see U.S. 
Patent No. 5, 767, 373), or other markers conferring resistance to an herbicide such as 
glufosinate. In addition, markers that facilitate identification of a plant cell containing the 
polynucleotide encoding the marker include, for example, luciferase (Giacomin, Plant Sci. 

20 1 16:59-72, 1996; Scikantha, J. BacterioL 178:121, 1996), green fluorescent protein (Gerdes, 
FEBS Lett. 389:44-47, 1996) or fl-glucuronidase (Jefferson, EMBO J. 6:3901-3907, 1997), 
and numerous others as disclosed herein or otherwise known in the art. Such markers also 
can be used as reporter molecules. 

A heterologous nucleotide sequence can encode an antisense molecule, particularly an 

25 antisense molecule specific for a plant stress-regulated nucleotide sequence, for example, the 
gene from which the regulatory component of the recombinant polynucleotide is derived. 
Such a recombinant polynucleotide can be useful for reducing the expression of a plant 
stress-regulated polypeptide in response to a stress condition because the antisense molecule, 
like the polypeptide, only will be induced upon exposure to the stress. A heterologous 

30 nucleotide sequence also can be, or can encode, a ribozyme or a triplexing agent. In addition 
to being useful as heterologous nucleotide sequences, such molecules also can be used 
directly in a method of the invention, for example, to modulate the responsiveness of a plant 
cell to a stress condition. Thus, an antisense molecule, ribozyme, or triplexing agent can be 
contacted directly with a target cell and, upon uptake by the cell, can effect their antisense, 
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ribozyme or triplexing activity; or can be encoded by a heterologous nucleotide sequence that 
is expressed in a plant cell from a plant stress-regulated regulatory element, whereupon it can 
effect its activity. 

An antisense polynucleotide, ribozyme or triplexing agent is complementary to a 
5 target sequence, which can be a DNA or RNA sequence, for example, messenger RNA, and 
can be a coding sequence, a nucleotide sequence comprising an intron-exon junction, a 
regulatory sequence, or the like. The degree of complementarity is such that the 
polynucleotide, for example, an antisense polynucleotide, can interact specifically with the 
target sequence in a cell. Depending on the total length of the antisense or other 

10 polynucleotide, one or a few mismatches with respect to the target sequence can be tolerated 
without losing the specificity of the polynucleotide for its target sequence. Thus, few if any 
mismatches would be tolerated in an antisense molecule consisting, for example, of twenty 
nucleotides, whereas several mismatches will not affect the hybridization efficiency of an 
antisense molecule that is complementary, for example, to the full length of a target mRNA 

1 5 encoding a cellular polypeptide. The number of mismatches that can be tolerated can be 
estimated, for example, using well known formulas for determining hybridization kinetics 
(see Sambrook et al., "Molecular Cloning; A Laboratory Manual" 2nd Edition (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY; 1989) or can be determined empirically 
using methods as disclosed herein or otherwise known in the art, particularly by determining 

20 that the presence of the antisense polynucleotide, ribozyme, or triplexing agent in a cell 
decreases the level of the target sequence or the expression of a polypeptide encoded by the 
target sequence in the cell. 

A nucleotide sequence useful as an antisense molecule, a ribozyme or a triplexing agent 
can inhibit translation or cleave a polynucleotide encoded by plant stress-regulated nucleotide 

25 sequence, thereby modulating the responsiveness of a plant cell to a stress condition. An 
antisense molecule, for example, can bind to an mRNA to form a double stranded molecule 
that cannot be translated in a cell. Antisense oligonucleotides of at least about 15 to 25 
nucleotides are typically used since they are easily synthesized and can hybridize specifically 
with a target sequence, although longer antisense molecules can be expressed from a 

30 recombinant polynucleotide introduced into the target cell. Specific nucleotide sequences 
useful as antisense molecules can be identified using well-known methods, for example, gene 
walking methods (see, for example, Seimiya et al., J. Biol. Chem. 272:4631-4636, 1997). 
Where the antisense molecule is contacted directly with a target cell, it can be operatively 
associated with a chemically reactive group such as iron-linked EDTA, which cleaves a target 
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RNA at the site of hybridization. A triplexing agent, in comparison, can stall transcription 
(Maher et al., Antisense Res. Devel. 1 :227, 1991 ; Helene, Anticancer Drug Design 6:569, 
1991). 

A plant stress-regulated regulatory element can be included in an expression cassette. 
5 As used herein, the term "expression cassette" refers to a nucleotide sequence that can direct 
expression of a particular polynucleotide in an appropriate host cells. Expression cassettes 
typically comprise as operably linked components, a promoter, a nucleotide sequence whose 
expression is desired and a termination signal. Expression cassettes also often contain 
sequences necessary for proper translation of the sequence to be expressed along with 

10 selection and marker sequences. Thus, a plant stress-regulated regulatory element can 
constitute an expression cassette, or component thereof. An expression cassette is 
particularly useful for directing expression of a nucleotide sequence, which can be an 
endogenous nucleotide sequence or a heterologous nucleotide sequence, in a cell, particularly 
a plant cell. In general, an expression cassette can be introduced into a plant cell such that 

15 the plant cell, a plant resulting from the plant cell, seeds obtained from such a plant, or plants 
produced from such seeds are resistant to a stress condition. 

Additional regulatory sequences as disclosed herein or other desirable sequences such 
as selectable markers or the like can be incorporated into an expression cassette containing a 
plant stress-regulated regulatory element (see, for example, WO 99/47552). Examples of 

20 suitable markers include dihydrofolate reductase (DHFR) or neomycin resistance for 

eukaryotic cells and tetracycline or ampicillin resistance for E. coli. Selection markers in 
plants include bleomycin, gentamycin, glyphosate, hygromycin, kanamycin, methotrexate, 
phleomycin, phosphinotricin, spectinomycin, dtreptomycin, sulfonamide and sulfonylureas 
resistance. Maliga et aL, Methods in Plant Molecular Biolojgy. Cold Spring Harbor 

25 Laboratory Press, 1995, p. 39. The selection marker can have its own promoter or its 
expression can be driven by the promoter operably linked to the sequence of interest. 
Additional sequences such as intron sequences (e.g. from Adhl or bronzel) or viral leader 
sequences (e.g. from TMV, MCMV and AIVIV), all of which can enhance expression, can 
be included in the cassette. In addition, where it is desirable to target expression of a 

30 nucleotide sequence operatively linked to the stress-regulated regulatory element, a sequence 
encoding a cellular localization motif can be included in the cassette, for example, such that 
an encoded transcript or translation product is translocated to and localizes in the cytosol, 
nucleus, a chloroplast, or another subcellular organelle. Examples of useful transit peptides 
and transit peptide sequences can be found in Von Heijne et al. Plant Mol. Biol Rep. 9: 104, 
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1991; Clark et al. J. Biol. Chem. 264: 17544, 1989; della-Cioppa et al. Plant Phvsiol 84: 965, 
1987; Romer et al. Biochem. Biophvs. Res. Commun. 196: 1414, 1993; and Shah et al., 
Science 233: 478, 1986; Archer et al, J. Bioenerg Biomembr .» 22:789, 1990; Scandalios, 
Prog. Clin. Biol. Res. 344:515, 1990; Weisbeek et al., J. Cell Sci. SuppL 11:199, 1989; 
5 Bruce, Trends Cell Biol., 10:440, 2000. The present invention can utilize native or 

heterologous transit peptides. The encoding sequence for a transit peptide can include all or a 
portion of the encoding sequence for a particular transit peptide, and may also contain 
portions of the mature protein encoding sequence associated with a particular transit peptide. 
A functional portion of a plant stress-regulated plant polynucleotide, or an expression 

1 0 cassette, can be introduced into a cell as a naked DNA molecule, can be incorporated in a 
matrix such as a liposome or a particle such as a viral particle, or can be incorporated into a 
vector. Such vectors can be cloning or expression vectors, but other uses are within the scope 
of the present invention. A cloning vector is a self-replicating DNA molecule that serves to 
transfer a DNA segment into a host cell. The three most common types of cloning vectors 

15 are bacterial plasmids, phages, and other viruses. An expression vector is a cloning vector 
designed so that a coding sequence inserted at a particular site will be transcribed and 
translated into a protein. 

Incorporation of the polynucleotide into a vector can facilitate manipulation of the 
polynucleotide, or introduction of the polynucleotide into a plant cell. A vector can be 

20 derived from a plasmid or a viral vector such as a T-DNA vector (Horsch et al., Science 
227:1229-1231, 1985). If desired, the vector can comprise components of a plant 
transposable element, for example, a Ds transposon (Bancroft and Dean, Genetics 134:1221- 
1229, 1993) or an Spm transposon (Aarts et al., Mol. Gen. Genet. 247:555-564, 1995). 
In addition to containing the polynucleotide portion of a plant stress-regulated 

25 polynucleotide, a vector can contain various nucleotide sequences that facilitate, for example, 
rescue of the vector from a transformed plant cell; passage of the vector in a host cell, which 
can be a plant, animal, bacterial, or insect host cell; or expression of an encoding nucleotide 
sequence in the vector, including all or a portion of a rescued coding region. As such, the 
vector can contain any of a number of additional transcription and translation elements, 

30 including constitutive and inducible promoters, enhancers, and the like (see, for example, 
Bitter et al., Meth. Enzvmol. 153:5 16-544, 1987). For example, a vector can contain 
elements useful for passage, growth or expression in a bacterial system, including a bacterial 
origin of replication; a promoter, which can be an inducible promoter; and the like. In 
comparison, a vector that can be passaged in a mammalian host cell system can have a 
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promoter such as a metallothionein promoter, which has characteristics of both a constitutive 
promoter and an inducible promoter, or a viral promoter such as a retrovirus long terminal 
repeat, an adenovirus late promoter, or the like. A vector also can contain one or more 
restriction endonuclease recognition and cleavage sites, including, for example, a polylinker 
5 sequence, to facilitate rescue of a nucleotide sequence operably linked to the polynucleotide 
portion. 

The present invention also relates to a method of using a polynucleotide portion of a 
plant stress-regulated nucleotide sequence to confer a selective advantage on a plant cell. 
Such a method can be performed by introducing, for example, a plant stress-regulated 

10 regulatory element into a plant cell, wherein, upon exposure of the plant cell to a stress 
condition to which the regulatory element is responsive, a nucleotide sequence operatively 
linked to the regulatory element is expressed, thereby conferring a selective advantage to 
plant cell. The operatively linked nucleotide sequence can be a heterologous nucleotide 
sequence, which can be operatively linked to the regulatory element prior to introduction of 

15 the regulatory sequence into the plant cell; or can be an endogenous nucleotide sequence into 
which the regulatory element was targeted by a method such as homologous recombination. 
The selective advantage conferred by the operatively linked nucleotide sequence can be such 
that the plant is better able to tolerate the stress condition; or can be any other selective 
advantage. 

20 As used herein, the term "selective advantage" refers to the ability of a particular 

organism to better propagate, develop, grow, survive, or otherwise tolerate a condition as 
compared to a corresponding reference organism that does not contain a plant-stress 
regulated polynucleotide of the present invention. In one embodiment, a selective advantage 
is exemplified by the ability of a desired plant, plant cell, or the like, that contains an 

25 introduced plant stress-regulated regulatory element, to grow better than an undesired plant, 
plant cell, or the like, that does not contain the introduced regulatory element. For example, a 
recombinant polynucleotide comprising a plant stress-regulated regulatory element 
operatively linked to a heterologous nucleotide sequence encoding an enzyme that inactivates 
a herbicide can be introduced in a desired plant. Upon exposure of a mixed population of 

30 plants comprising the desired plants, which contain the recombinant polynucleotide, and one 
ormore other populations of undesired plants, which lack the recombinant polynucleotide, to 
a stress condition that induces expression of the regulatory element and to the herbicide, the 
desired plants will have a greater likelihood of surviving exposure to the toxin and, therefore, 
a selective advantage over the undesired plants. 
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In another embodiment, a selective advantage is exemplified by the ability of a 
desired plant, plant cell, or the like, to better propagate, develop, grow, survive, or otherwise 
tolerate a condition as compared to an undesired plant, plant cell, or the like, that contains an 
introduced plant stress-regulated regulatory element. For example, a recombinant 
5 polynucleotide comprising a plant stress-regulated regulatory element operatively linked to a 
plant cell toxin can be introduced into cells of an undesirable plant present in a mixed 
population of desired and undesired plants, for example, food crops and weeds, respectively, 
then the plants can be exposed to stress conditions that induce expression from the plant 
stress-regulated regulatory element, whereby expression of the plant cell toxin results in 

1 0 inhibition of growth or death of the undesired plants, thereby providing a selective advantage 
to the desired plants, which no longer have to compete with the undesired plants for nutrients, 
light, or the like. In another example, a plant stress-regulated regulatory element operatively 
linked to a plant cell toxin can be introduced into cells of plants used as a nurse crop. Nurse 
crops, also called cover or companion crops, are planted in combination with plants of 

15 interest to provide, among other things, shade and soil stability during establishment of the 
desired plants. Once the desired plants have become established, the presence of the nurse 
crop may no longer be desirable. Exposure to conditions inducing expression of the gene 
linked to the plant stress-regulated regulatory element allows elimination of the nurse crop. 
Alternatively nurse crops can be made less tolerant to abiotic stress by the inhibition of any 

20 of the stress-regulated sequences disclosed herein. Inhibition can be accomplished by any of 
the method described herein. Upon exposure of the nurse crop to the stress, the decreased 
ability of the nurse crop to respond to the stress will result in elimination of the nurse crop, 
leaving only the desired plants. 

The invention also provides a means of producing a transgenic plant, which comprises 

25 plant cells that exhibit altered responsiveness to a stress condition. As such, the present 
invention further provides a transgenic plant, or plant cells or tissues derived therefrom, 
which are genetically modified to respond to stress differently than a corresponding wild-type 
plant or plant not containing constructs of the present invention would respond. As used 
herein, the term "responsiveness to a stress condition" refers to the ability of a plant to 

30 express a plant stress-regulated polynucleotide upon exposure to the stress condition. A 
transgenic plant cell contains a functional portion of a plant stress-regulated polynucleotide, 
or a mutant form thereof, for example, a knock-out mutant. A knock-out mutant form of a 
plant stress-regulated nucleotide sequence can contain, for example, a mutation such that a 
STOP codon is introduced into the reading frame of the translated portion of the gene such 



WO 03/008540 PCT/US02/19668 

50 

that expression of a functional stress-regulated polypeptide is prevented; or a mutation in the 
stress-regulated regulatory element such that inducibility of the element in response to a 
stress condition is inhibited. Such transgenic plants of the invention can display any of 
various idiotypic modifications is response to an abiotic stress, including altered tolerance to 
5 the stress condition, as well as increased or decreased plant growth, root growth, yield, or the 
like, as compared to the corresponding wild-type plant. 

The term "plant" is used broadly herein to include any plant at any stage of 
development, or to part of a plant, including a plant cutting, a plant cell, a plant cell culture, a 
plant organ, a plant seed, and a plantlet. A plant cell is the structural and physiological unit 

10 of the plant, comprising a protoplast and a cell wall. A plant cell can be in the form of an 
isolated single cell or a cultured cell, or can be part of higher organized unit, for example, a 
plant tissue, plant organ, or plant. Thus, a plant cell can be a protoplast, a gamete producing 
cell, or a cell or collection of cells that can regenerate into a whole plant. As such, a seed, 
which comprises multiple plant cells and is capable of regenerating into a whole plant, is 

1 5 considered plant cell for purposes of this disclosure. A plant tissue or plant organ can be a 
seed, protoplast, callus, or any other groups of plant cells that is organized into a structural or 
functional unit. Particularly useful parts of a plant include harvestable parts and parts useful 
for propagation of progeny plants. A harvestable part of a plant can be any useful part of a 
plant, for example, flowers, pollen, seedlings, tubers, leaves, stems, fruit, seeds, roots, and the 

20 like. A part of a plant useful for propagation includes, for example, seeds, fruits, cuttings, 
seedlings, tubers, rootstocks, and the like. 

A transgenic plant can be regenerated from a transformed plant cell. As used herein, 
the term "regenerate" means growing a whole plant from a plant cell; a group of plant cells; a 
protoplast; a seed; or a piece of a plant such as a callus or tissue. Regeneration from 

25 protoplasts varies from species to species of plants. For example, a suspension of protoplasts 
can be made and, in certain species, embryo formation can be induced from the protoplast 
suspension, to the stage of ripening and germination. The culture media generally contains 
various components necessary for growth and regeneration, including, for example, 
hormones such as auxins and cytokinins; and amino acids such as glutamic acid and proline, 

30 depending on the particular plant species. Efficient regeneration will depend, in part, on the 
medium, the genotype, and the history of the culture. If these variables are controlled, 
however, regeneration is reproducible. 

Regeneration can occur from plant callus, explants, organs or plant parts. 
Transformation can be performed in the context of organ or plant part regeneration, (see 
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Meth. Enzvmol. Vol. 1 18; Klee et al. Ann. Rev, Plant Physiol. 38:467 (1987)). Utilizing the 
leaf disk-transformation-regeneration method, for example, disks are cultured on selective 
media, followed by shoot formation in about two to four weeks (see Horsch et al., Science 
227:1229, 1985). Shoots that develop are excised from calli and transplanted to appropriate 
5 root-inducing selective medium. Rooted plantlets are transplanted to soil as soon as possible 
after roots appear. The plantlets can be repotted as required, until reaching maturity. 

In vegetatively propagated crops, the mature transgenic plants are propagated utilizing 
cuttings or tissue culture techniques to produce multiple identical plants. Selection of 
desirable transgenotes is made and new varieties are obtained and propagated vegetatively for 

10 commercial use. In seed propagated crops, the mature transgenic plants can be self crossed to 
produce a homozygous inbred plant. The resulting inbred plant produces seeds that contain 
the introduced plant stress-induced regulatory element, and can be grown to produce plants 
that express a polynucleotide or polypeptide in response to a stress condition that induces 
expression from the regulatory element. As such, the invention further provides seeds 

1 5 produced by a transgenic plant obtained by a method of the invention. 

In addition, transgenic plants comprising different recombinant sequences can be 
crossbred, thereby providing a means to obtain transgenic plants containing two or more 
different transgenes, each of which contributes a desirable characteristic to the plant. 
Methods for breeding plants and selecting for crossbred plants having desirable 

20 characteristics or other characteristics of interest are well known in the art. 

A method of the invention can be performed by introducing a functional portion of a 
plant stress-regulated nucleotide sequence into the plant. As used herein, the term 
"introducing" means transferring a polynucleotide into a plant cell. A polynucleotide can be 
introduced into a cell by a variety of methods well known to those of ordinary skill in the art. 

25 For example, the polynucleotide can be introduced into a plant cell using a direct gene 

transfer method such as electroporation or microprojectile mediated transformation, or using 
Agrobacterium mediated transformation. Non-limiting examples of methods for the 
introduction of polynucleotides into plants are provided in greater detail herein. As used 
herein, the term "transformed" refers to a plant cell containing an exogenously introduced 

30 polynucleotide portion of a plant stress-regulated nucleotide sequence that is or can be 
rendered active in a plant cell, or to a plant comprising a plant cell containing such a 
polynucleotide. 

It should be recognized that one or more polynucleotides, which are the same or 
different can be introduced into a plant, thereby providing a means to obtain a genetically 
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modified plant containing multiple copies of a single transgenic sequence, or containing two 
or more different transgenic sequences, either or both of which can be present in multiple 
copies. Such transgenic plants can be produced, for example, by simply selecting plants 
having multiple copies of a single type of transgenic sequence; by co-transfecting plant cells 
5 with two or more populations of different transgenic sequences and identifying those 
containing the two or more different transgenic sequences; or by crossbreeding transgenic 
plants, each of which contains one or more desired transgenic sequences, and identifying 
those progeny having the desired sequences. 

Methods for introducing a polynucleotide into a plant cell to obtain a transformed 

10 plant also include direct gene transfer (see European Patent A 164 575), injection, 
electroporation, biolistic methods such as particle bombardment, pollen-mediated 
transformation, plant RNA virus-mediated transformation, liposome-mediated 
transformation, transformation using wounded or enzyme-degraded immature embryos, or 
wounded or enzyme-degraded embryogenic callus, and the like. Transformation methods 

1 5 using Agrobacterium tumefaciens tumor inducing (Ti) plasmids or root-inducing (Ri) 

plasmids, or plant virus vectors are well known in the art (see, for example, WO 99/47552; 
Weissbach & Weissbach, "Methods for Plant Molecular Biology", Academic Press, 1988), 
section VIII, pages 421-463; Grierson and Corey, "Plant Molecular Biology" 2d Ed. Blackie, 
London, 1988), Chapters 7-9; Horsch et al., Science 227:1229, 1985). The wild-type form of 

20 Agrobacterium, for example, contains a Ti plasmid, which directs production of tumorigenic 
crown gall growth on host plants. Transfer of the tumor inducing T-DNA region of the Ti 
plasmid to a plant genome requires the Ti plasmid-encoded virulence genes as well as T-DNA 
borders, which are a set of direct DNA repeats that delineate the region to be transferred. An 
Agrobacterium based vector is a modified form of a Ti plasmid, in which the tumor inducing 

25 functions are replaced by a nucleotide sequence of interest that is to be introduced into the 
plant host. 

Methods of using Agrobacterium mediated transformation include cocultivation of 
Agrobacterium with cultured isolated protoplasts; transformation of plant cells or tissues with 
Agrobacterium; and transformation of seeds, apices or meristems with Agrobacterium. In 
30 addition, in planta transformation by Agrobacterium can be performed using vacuum 
infiltration of a suspension of Agrobacterium cells (Bechtold et al., C.R. Acad. Sci. Paris 
316:1194, 1993). 

Agrobacterium mediated transformation can employ cointegrate vectors or binary 
vector systems, in which the components of the Ti plasmid are divided between a helper 
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vector, which resides permanently in the Agrobacterium host and carries the virulence genes, 
and a shuttle vector, which contains the gene of interest bounded by T-DNA sequences. 
Binary vectors are well known in the art (see, for example, DeFramond, BioTechnology 
1 :262, 1983; Hoekema et al., Nature 303:179, 1983) and are commercially available 
5 (Clontech; Palo Alto CA). For transformation, Agrobacterium can be cocultured, for 
example, with plant cells or wounded tissue such as leaf tissue, root explants, 
hypocotyledons, stem pieces or tubers (see, for example, Glick and Thompson, "Methods in 
Plant Molecular Biology and Biotechnology" ,Boca Raton FL, CRC Press, 1993). Wounded 
cells within the plant tissue that have been infected by Agrobacterium can develop organs de 

1 0 novo when cultured under the appropriate conditions; the resulting transgenic shoots 

eventually give rise to transgenic plants, which contain an exogenous polynucleotide portion 
of a plant stress-regulated nucleotide sequence. 

Agrobacterium mediated transformation has been used to produce a variety of 
transgenic plants, including, for example, transgenic cruciferous plants such as Arabidopsis, 

15 mustard, rapeseed and flax; transgenic leguminous plants such as alfalfa, pea, soybean, trefoil 
and white clover; and transgenic solanaceous plants such as eggplant, petunia, potato, 
tobacco and tomato (see, for example, Wang et al, "Transformation of Plants and Soil 
Microorganisms", Cambridge University Press, 1995). In addition, Agrobacterium mediated 
transformation can be used to introduce an exogenous polynucleotide sequence, for example, 

20 a plant stress-regulated regulatory element into apple, aspen, belladonna, black currant, 
carrot, celery, cotton, cucumber, grape, horseradish, lettuce, morning glory, muskmelon, 
neem, poplar, strawberry, sugar beet, sunflower, walnut, asparagus, rice and other plants (see, 
for example, Glick and Thompson, supra, 1993; Hiei et al, Plant J. 6:271-282, 1994; 
Shimamoto, Science 270:1772-1773, 1995). 

25 Suitable strains of Agrobacterium tumefaciens and vectors as well as transformation 

of Agrobacteria and appropriate growth and selection media are well known in the art 
(GV3101, pMK90RK), Koncz, Mol. Gen. Genet. 204:383-396, 1986; (C58C1, 
pGV3850kan), Deblaere, Nucl. Acid Res. 13:4777, 1985; Bevan, Nucleic Acid Res. 12:8711, 
1984; Koncz, Proc. Natl. Acad. Sci. USA 86:8467-8471, 1986; Koncz, Plant Mol. Biol. 

30 20:963-976, 1992; Koncz, "Specialized vectors for gene tagging and expression studies", in: 
Plant Molecular Biology Manual Vol. 2, Gelvin and Schilperoort (Eds.), Dordrecht, The 
Netherlands: Kluwer Academic Publ. (1994), 1-22; European Patent A- 1 20 516; Hoekema, 
"The Binary Plant Vector System", Offsetdrukkerij Kanters B. V., Alblasserdam (1985), 
Chapter V; Fraley, Crit. Rev. Plant. Sci.. 4:1-46; An, EMBO J. 4:277-287, 1985). 
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Where a polynucleotide portion of a plant stress-regulated nucleotide sequence is 
contained in vector, the vector can contain functional elements, for example "left border" and 
"right border" sequences of the T-DNA of Agrobacteriwn, which allow for stable integration 
into a plant genome. Furthermore, methods and vectors that permit the generation of 

5 marker-free transgenic plants, for example, where a selectable marker gene is lost at a certain 
stage of plant development or plant breeding, are known, and include, for example, methods 
of co-transformation (Lvznik. Plant Mol. Biol. 13:151-161, 1989; Peng. Plant Mol. Biol. 
27:91- 104, 1995), or methods that utilize enzymes capable of promoting homologous 
recombination in plants (see, e.g., W097/08331; Bayley, Plant Mol. Biol. 18:353-361, 1992; 

10 Lloyd, Mol. Gen. Genet. 242:653-657, 1994; Maeser, Mol. Gen. Genet. 230:170-176, 1991; 
Onouchi, Nucl. Acids Res. 19:6373-6378, 1991; see, also, Sambrook et al., supra, 1989). 

A direct gene transfer method sucfr as electroporation also can be used to introduce a 
polynucleotide portion of a plant stress-regulated nucleotide sequence into a cell such as a 
plant cell. For example, plant protoplasts can be electroporated in the presence of the 

15 regulatory element, which can be in a vector (Fromm et al., Proc. Natl. Acad. ScL USA 
82:5824, 1985). Electrical impulses of high field strength reversibly permeabilize 
membranes allowing the introduction of the nucleic acid. Electroporated plant protoplasts 
reform the cell wall, divide and form a plant callus. Microinjection can be performed as 
described in Potrykus and Spangenberg (eds.), Gene Transfer To Plants. Springer Verlag, 

20 Berlin, NY, 1995. A transformed plant cell containing the introduced polynucleotide can be 
identified by detecting a phenotype due to the introduced polynucleotide, for example, 
increased or decreased tolerance to a stress condition. 

Microprojectile mediated transformation also can be used to introduce a 
polynucleotide into a plant cell (Klein et al., Nature 327:70-73 (1987)). This method utilizes 

25 microprojectiles such as gold or tungsten, which are coated with the desired nucleic acid 
molecule by precipitation with calcium chloride, spermidine or polyethylene glycol. The 
microprojectile particles are accelerated at high speed into a plant tissue using a device such 
as the BIOLISTIC PD-1000 (Biorad; Hercules CA). 

Microprojectile mediated delivery ("particle bombardment") is especially useful to 

30 transform plant cells that are difficult to transform or regenerate using other methods. 

Methods for the transformation using biolistic methods are well known (Wan, Plant Phvsiol. 
104:37-48, 1984; Vasil, Bio/Technology 11:1553-1558. 1993; Christou, TrendsTn Plant 
Science 1:423-431, 1996). Microprojectile mediated transformation has been used, for 
example, to generate a variety of transgenic plant species, including cotton, tobacco, corn, 
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hybrid poplar and papaya (see Glick and Thompson, supra, 1993). Important cereal crops 
such as wheat, oat, barley, sorghum and rice also have been transformed using 
microprojectile mediated delivery (Duan et al., Nature Biotech, 14:494-498, 1996; 
Shimamoto, Curr. Qpin. Biotech. 5:1 58-162, 1994). A rapid transformation regeneration 
5 system for the production of transgenic plants such as a system that produces transgenic 

wheat in two to three months (see European Patent No. EP 0709462A2) also can be useful for 
producing a transgenic plant using a method of the invention, thus allowing more rapid 
identification of gene functions. The transformation of most dicotyledonous plants is 
possible with the methods described above. Transformation of monocotyledonous plants also 

10 can be transformed using, for example, biolistic methods as described above, protoplast 
transformation, electroporation of partially permeabilized cells, introduction of DNA using 
glass fibers, Agrobacterium mediated transformation, and the like. 

Plastid transformation also can be used to introduce a polynucleotide portion of a 
plant stress-regulated nucleotide sequence into a plant cell (U.S. Patent Nos. 5,451,513, 

15 5,545,817, and 5,545,818; WO 95/16783; McBride et al., Proc. Natl. Acad. ScL USA 
91:7301-7305, 1994). Chloroplast transformation involves introducing regions of cloned 
plastid DNA flanking a desired nucleotide sequence, for example, a selectable marker 
together with polynucleotide of interest into a suitable target tissue, using, for example, a 
biolistic or protoplast transformation method (e.g., calcium chloride or PEG mediated 

20 transformation). One to 1 .5 kb flanking regions ("targeting sequences") facilitate 
homologous recombination with the plastid genome, and allow the replacement or 
modification of specific regions of the plastome. Using this method, point mutations in the 
chloroplast 16S rRNA and rpsl2 genes, which confer resistance to spectinomycin and 
streptomycin, can be utilized as selectable markers for transformation (Svab et al, Proc. Natl. 

25 Acad. ScL USA 87:8526-8530, 1 990; Staub and Maliga, Plant Cell 4:39-45, 1 992), resulted 
in stable homopiasmic transformants; at a frequency of approximately one per 100 
bombardments of target leaves. The presence of cloning sites between these markers allowed 
creation of a plastid targeting vector for introduction of foreign genes (Staub and Maliga, 
EMBO J. 12:601-606, 1993). Substantial increases in transformation frequency are obtained 

30 by replacement of the recessive rRNA or r-protein antibiotic resistance genes with a 
dominant selectable marker, the bacterial aadA gene encoding the spectinomycin- 
detoxifying enzyme aminoglycoside-3*-adenyltransf erase (Svab and Maliga, Proc. Natl. 
Acad. Sci.. USA 90:913-917, 1993). Approximately 15 to 20 cell division cycles following 
transformation are generally required to reach a homoplastidic state. Plastid expression, in 
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which genes are inserted by homologous recombination into all of the several thousand 
copies of the circular plastid genome present in each plant cell, takes advantage of the 
enormous copy number advantage over nuclear-expressed genes to permit expression levels 
that can readily exceed 10% of the total soluble plant protein. 

5 Plants suitable to treatment according to a method of the invention can be monocots 

or dicots and include, but are not limited to, corn (Zea mays), Brassica sp. (e.g., B. napus, B. 
rapa, B.juncea), particularly those Brassica species useful as sources of seed oil, alfalfa 
(Medicago sativa), rice (Oryza sativa), rye (Secale cereale), sorghum {Sorghum bicolor, 
Sorghum vulgare), millet (e.g., pearl millet (Pennisetum glaucum), proso millet (Panicum 

1 0 miliaceum), foxtail millet (Setaria italica), finger millet (Eleusine coracana)), sunflower 
(Helianthus annuus), safflower (Carthamus tinctorius), wheat {Triticum aestivum), soybean 
(Glycine max), tobacco (Nicotiana tabacum), potato (Solatium tuberosum), peanuts (Arachis 
hypogaea), cotton (Gossypium barbadense, Gossypium hirsutum), sweet potato (Ipomoea 
batatus), cassava (Manihot esculenta), coffee (Cofea spp.), coconut (Cocos nucifera), 

15 pineapple (Ananas comosus), citrus trees (Citrus spp.), cocoa (Theobroma cacao), tea 

(Camellia sinensis), banana (Musa spp.), avocado (Persea ultilane), fig (Ficus casica), guava 
(Psidium guajava), mango (Mangifera indica), olive (Olea europaea), papaya (Carica 
papaya), cashew (Anacardium occidental^), macadamia (Macadamia integrifolia), almond 
(Prunus amygdalus), sugar beets (Beta vulgaris), sugarcane (Saccharum spp.), oats, 

20 duckweed (Lemna), barley, tomatoes (Lycopersicon esculentum), lettuce (e.g., Lactuca 
sativa), green beans (Phaseolus vulgaris), lima beans (Phaseolus limensis), peas (Lathyrus 
spp.), and members of the genus Cucumis such as cucumber (C sativus), cantaloupe (C. 
cantalupensis), and musk melon (C. melo). 

Ornamentals such as azalea (Rhododendron spp.), hydrangea (Macrophylla 

25 hydrangea), hibiscus (Hibiscus rosasanensis), roses (Rosa spp.), tulips (Tulipa spp.), 

daffodils (Narcissus spp.), petunias (Petunia hybrida), carnation (Dianthus caryophyllus), 
poinsettia (Euphorbia pulcherrima), and chrysanthemum are also included. Additional 
ornamentals within the scope of the invention include impatiens, Begonia, Pelargonium, 
Viola, Cyclamen, Verbena, Vinca, Tagetes, Primula, Saint Paulia, Agertum, Amaranthus, 

30 Antihirrhinum, Aquilegia, Cineraria, Clover, Cosmo, Cowpea, Dahlia, Datura, Delphinium, 
Gerbera, Gladiolus, Gloxinia, Hippeastrum, Mesembryanthemum, Salpiglossos, and Zinnia. 

Conifers that may be employed in practicing the present invention include, for 
example, pines such as loblolly pine (Pinus taeda), slash pine (Pinus elliotii), ponderosa pine 
(Pinus ponderosa), lodgepole pine (Pinus contorta), and Monterey pine (Pinus radiata), 
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Douglas-fir (Pseudotsuga menziesii)\ Western hemlock (Tsuga ultilane); Sitka spruce (Picea 
glauca); redwood (Sequoia sempervirens); true firs such as silver fir (Abies amabilis) and 
balsam fir (Abies balsamea)\ and cedars such as Western red cedar (Thuja plicata) and 
Alaska yellow-cedar (Chamaecyparis nootkatensis). 

5 Leguminous plants which may be used in the practice of the present invention include 

beans and peas. Beans include guar, locust bean, fenugreek, soybean, garden beans, cowpea, 
mungbean, lima bean, fava bean, lentils, chickpea, etc. Legumes include, but are not limited 
to, ArachiSy e.g., peanuts, Vicia, e.g., crown vetch, hairy vetch, adzuki bean, mung bean, and 
chickpea, Lupinus, e.g., lupine, trifolium, Phaseolus> e.g., common bean and lima bean, 

10 Pisum, e.g., field bean, Melilotus, e.g., clover, Medicago, e.g., alfalfa, Lotus, e.g., trefoil, 
lens, e.g., lentil, and false indigo. Forage and turf grass for use in the methods of the 
invention include alfalfa, orchard grass, tall fescue, perennial ryegrass, creeping bent grass, 
and redtop. 

Other plants within the scope of the invention include Acacia, aneth, artichoke, arugula, 
15 blackberry, canola, cilantro, Clementines, escarole, eucalyptus, fennel, grapefruit, honey dew, 

jicama, kiwifruit, lemon, lime, mushroom, nut, okra, orange, parsley, persimmon, plantain, 

pomegranate, poplar, radiata pine, radicchio, Southern pine, sweetgum, tangerine, triticale, 

vine, yams, apple, pear, quince, cherry, apricot, melon, hemp, buckwheat, grape, raspberry, 

chenopodium, blueberry, nectarine, peach, plum, strawberry, watermelon, eggplant, pepper, 
20 cauliflower, Brassica, e.g., broccoli, cabbage, ultilan sprouts, onion, carrot, leek, beet, broad 

bean, celery, radish, pumpkin, endive, gourd, garlic, snapbean, spinach, squash, turnip, 

ultilane, chicory, groundnut and zucchini. 

Angiosperms are divided into two broad classes based on the number of cotyledons, 

which are seed leaves that generally store or absorb food; a monocotyledonous angiosperm 
25 has a single cotyledon, and a dicotyledonous angiosperm has two cotyledons. Angiosperms 

produce a variety of useful products including materials such as lumber, rubber, and paper; 

fibers such as cotton and linen; herbs and medicines such as quinine and vinblastine; 

ornamental flowers such as roses and orchids; and foodstuffs such as grains, oils, fruits and 

vegetables. 

30 Angiosperms encompass a variety of flowering plants, including, for example, cereal 

j>lants, leguminous plants, oilseed plants, hardwood trees, fruit-bearing plants and ornamental 
flowers, which general classes are not necessarily exclusive. Cereal plants, which produce an 
edible grain cereal and are suitable for use in the present invention, include, for example, 
corn, rice, wheat, barley, oat, rye, millet, orchardgrass, guinea grass, sorghum and turfgrass. 
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Leguminous plants include members of the pea family (Fabaceae) and produce a 
characteristic fruit known as a legume. Examples of leguminous plants include, for example, 
soybean, pea, chickpea, moth bean, broad bean, kidney bean, lima bean, lentil, cowpea, dry 
bean, and peanut, as well as alfalfa, birdsfoot trefoil, clover and sainfoin. Oilseed plants, 
5 which have seeds that are useful as a source of oil, include soybean, sunflower, rapeseed 
(canola) and cottonseed. 

Angiosperms also include hardwood trees, which are perennial woody plants that 
generally have a single stem (trunk). Examples of such trees include alder, ash, aspen, 
basswood (linden), beech, birch, cherry, cottonwood, elm, eucalyptus, hickory, locust, maple, 

10 oak, persimmon, poplar, sycamore, walnut, sequoia, and willow. Trees are useful, for 
example, as a source of pulp, paper, structural material and fuel. 

Angiosperms are fruit-bearing plants that produce a mature, ripened ovary, which 
generally contains seeds. A fruit can be suitable for human or animal consumption or for 
collection of seeds to propagate the species. For example, hops are a member of the 

1 5 mulberry family that are prized for their flavoring in malt liquor. Fruit-bearing angiosperms 
also include grape, orange, lemon, grapefruit, avocado, date, peach, cherry, olive, plum, 
coconut, apple and pear trees and blackberry, blueberry, raspberry, strawberry, pineapple, 
tomato, cucumber and eggplant plants. An ornamental flower is an angiosperm cultivated for 
its decorative flower. Examples of commercially important ornamental flowers include rose, 

20 orchid, lily, tulip and chrysanthemum, snapdragon, camellia, carnation and petunia plants. 
The skilled artisan will recognize that the methods of the invention can be practiced using 
these or other angiosperms, as desired, as well as gymnosperms, which do not produce seeds 
in a fruit. 

A method of producing a transgenic plant can be performed by introducing a functional 
25 portion of plant stress-regulated polynucleotide into a plant cell genome, whereby the 

functional portion of the plant stress-regulated polynucleotide modulates a response of the 
plant cell to a stress condition, thereby producing a transgenic plant, which comprises plant 
cells that exhibit altered responsiveness to the stress condition. In one embodiment, the 
functional portion of the plant stress-regulated polynucleotide encodes a stress-regulated 
30 polypeptide or functional peptide portion thereof, wherein expression of the stress-regulated 
polypeptide or functional peptide portion thereof either increases the stress tolerance of the 
transgenic plant, or decreases the stress tolerance of the transgenic plant. The functional 
portion of the plant stress-regulated nucleotide sequence encoding the stress-regulated 
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polypeptide or functional peptide portion thereof can be operatively linked to a heterologous 
promoter. 

In another embodiment, the polynucleotide portion of the plant stress-regulated 
nucleotide sequence comprises a stress-regulated regulatory element. The stress-regulated 
5 regulatory element can integrate into the plant cell genome in a site-specific manner, 

whereupon it can be operatively linked to an endogenous nucleotide sequence, which can be 
expressed in response to a stress condition specific for the regulatory element; or can be a 
mutant regulatory element, which is not responsive to the stress condition, whereby upon 
integrating into the plant cell genome, the mutant regulatory element disrupts an endogenous 

1 0 stress-regulated regulatory element of a plant stress-regulated nucleotide sequence, thereby 
altering the responsiveness of the plant stress-regulated nucleotide sequence to the stress 
condition. Accordingly, the invention also provides genetically modified plants, including 
transgenic plants, produced by such a method, and a plant cell obtained from such genetically 
modified plant, wherein said plant cell exhibits altered responsiveness to the stress condition; 

1 5 a seed produced by a transgenic plant; and a cDNA library prepared from a transgenic plant. 

Also provided is a method of modulating the responsiveness of a plant cell to a stress 
condition. Such a method can be performed, for example, by introducing a functional portion 
of a plant stress-regulated nucleotide sequence into the plant cell, thereby modulating the 
responsiveness of the plant cell to a stress condition. As disclosed herein, the responsiveness 

20 of the plant cell can be increased or decreased upon exposure to the stress condition, and the 
altered responsiveness can result in increased or decreased tolerance of the plant cell to a 
stress condition. The functional portion of the plant stress-regulated polynucleotide can, but 
need not, be integrated into the genome of the plant cell, thereby modulating the 
responsiveness of the plant cell to the stress condition. Accordingly, the invention also 

25 provide a genetically modified plant, including a transgenic plant, which contains an 

introduced polynucleotide portion of a plant stress-regulated nucleotide sequence, as well as 
plant cells, tissues, and the like, which exhibit modulated responsiveness to a stress 
condition. 

The functional portion of the plant stress-regulated polynucleotide can encode a 
30 stress-regulated polypeptide or functional peptide portion thereof, which can be operatively 
linked to a heterologous promoter. As used herein, reference to a "functional peptide portion 
of a plant stress-regulated polypeptide" means a contiguous amino acid sequence of the " 
polypeptide that has at least 50%, at least 75%, at least 90%, or at least 95% the activity of 
the full length polypeptide, or that has an antagonist activity with respect to the full length 
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polypeptide, or that presents an epitope unique to the polypeptide. Thus, by expressing a 
functional peptide portion of a plant stress-regulated polypeptide in a plant cell, the peptide 
can act as an agonist or an antagonist of the polypeptide, thereby modulating the 
responsiveness of the plant cell to a stress condition. It should be noted that while the 
5 functional peptide portion has an activity of the full length polypeptide, the activity need not 
be of the same magnitude. Thus, the functional peptide portion can have an activity with is 
greater or lesser in magnitude than the full length polypeptide. 

A functional portion of the plant stress-regulated polynucleotide also can contain a 
mutation, whereby upon integrating into the plant cell genome, the polynucleotide disrupts 

1 0 (knocks-out) an endogenous plant stress-regulated nucleotide sequence, thereby modulating 
the responsiveness of said plant cell to the stress condition. Depending on whether the 
knocked-out gene encodes an adaptive or a maladaptive stress-regulated polypeptide, the 
responsiveness of the plant will be modulated accordingly. Thus, a method of the invention 
provides a means of producing a transgenic plant having a knock-out phenotype of a plant 

1 5 stress-regulated nucleotide sequence. 

Alternatively, the responsiveness of a plant or plant cell to a stress condition can be 
modulated by use of a suppressor construct containing dominant negative mutation for any of 
the stress-regulated polynucleotides described herein. Expression of a suppressor construct 
containing a dominant mutant mutation generates a mutant transcript that, when coexpressed 

20 with the wild-type transcript inhibits the action of the wild-type transcript. Methods for the 
design and use of dominant negative constructs are well known in the art and can be found, 
for example, in Herskowitz, Nature. 329:219-222. 1987 and Lagna and Hemmati-Brivanlou, 
Curr. Topics Devel. Biol .. 36:75-98, 1998. 

The functional portion of the plant stress-regulated polynucleotide sequence also can 

25 comprise a stress-regulated regulatory element, which can be operatively linked to a 

heterologous nucleotide sequence, which, upon expression from the regulatory element in 
response to a stress condition, modulates the responsiveness of the plant cell to the stress 
condition. Such a heterologous nucleotide sequence can encode, for example, a stress- 
inducible transcription factor such as DREB1 A, which, upon exposure to the stress condition, 

30 is expressed such that it can amplify the stress response (see Kasuga et al., Nat. BiotechnoL 
17:287-291 , 1999). The heterologous nucleotide sequence also can encode a polynucleotide 
that is specific for a plant stress-regulated nucleotide sequence, for example, an antisense 
molecule, a ribozyme, and a triplexing agent, either of which, upon expression in the plant 
cell, reduces or inhibits expression of a stress-regulated polypeptide encoded by the gene, 
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thereby modulating the responsiveness of the plant cell to a stress condition, for example, an 
abnormal level of cold, osmotic pressure, and salinity. As used herein, the term "abnormal," 
when used in reference to a condition such as temperature, osmotic pressure, salinity, or any 
other condition that can be a stress condition, means that the condition varies sufficiently 

5 from a range generally considered optimum for growth of a plant that the condition results in 
an induction of a stress response in a plant. Methods of determining whether a stress 
response has been induced in a plant are disclosed herein or otherwise known in the art. 

A plant stress-regulated regulatory element can be operatively linked to a heterologous 
polynucleotide sequence, such that the regulatory element can be introduced into a plant genome 

10 in a site-specific matter by homologous recombination. For example, a mutant plant stress- 
regulated regulatory element for a maladaptive stress-induced polypeptide can be 
transformed into a plant genome in a site specific manner by in vivo mutagenesis, using a 
hybrid RNA-DNA oligonucleotide ("chimeroplast" fflBTECH 15:441- 447, 1997; 
W0 95/15972; Kren, Henatology 25:1462-1468, 1997; Cole-Strauss, Science 273:1386-1389, 

1 5 1996). Part of the DNA component of the RNA-DNA oligonucleotide is homologous to a 
nucleotide sequence comprising the regulatory element of the maladaptive gene, but includes 
a mutation or contains a heterologous region which is surrounded by the homologous regions. 
By means of base pairing of the homologous regions of the RNA-DNA oligonucleotide and 
of the endogenous nucleic acid molecule, followed by a homologous recombination the 

20 mutation contained in the DNA component of the RNA-DNA oligonucleotide or the 

heterologous region can be transferred to the plant genome, resulting in a "mutant" gene that, 
for example, is not induced in response to a stress and, therefore, does not confer the 
maladaptive phenotype. Such a method similarly can be used to knock-out the activity of a 
stress-regulated nucleotide sequence, for example, in an undesirable plant. Such a method 

25 can provide the advantage that a desirable wild-type plant need not compete with the 
undesirable plant, for example, for light, nutrients, or the like. 

A method of modulating the responsiveness of a plant cell to a stress condition also 
can be performed by introducing a mutation in the chromosomal copy of a plant stress- 
regulated nucleotide sequence, for example, in the stress-regulated regulatory element, by 

30 transforming a cell with a chimeric oligonucleotide composed of a contiguous stretch of 
RNA and DNA residues in a duplex conformation with double hairpin caps on the ends. An 
additional feature of the oligonucleotide is the presence of 2*-0- methylation at the RNA " 
residues. The RNA/DNA sequence is designed to align with the sequence of a chromosomal 
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copy of the target regulatory element and to contain the desired nucleotide change (see U.S. 
Patent No. 5,501,967). 

A plant stress-regulated regulatory element also can be operatively linked to a 
heterologous polynucleotide such that, upon expression from the regulatory element in the plant 
5 cell, confers a desirable phenotype on the plant cell For example, the heterologous 
polynucleotide can encode an aptamer; which can bind to a stress-induced polypeptide. 
Aptamers are nucleic acid molecules that are selected based on their ability to bind to and 
inhibit the activity of a protein or metabolite. Aptamers can be obtained by the SELEX 
(Systematic Evolution of Ligands by Exponential Enrichment) method (see U.S. Patent No. 

10 5,270,163), wherein a candidate mixture of single stranded nucleic acids having regions of 
randomized sequence is contacted with a target, and those nucleic acids having a specific 
affinity to the target are partitioned from the remainder of the candidate mixture, and 
amplified to yield a ligand enriched mixture. After several iterations a nucleic acid molecule 
(aptamer) having optimal affinity for the target is obtained. For example, such a nucleic acid 

15 molecule can be operatively linked to a plant stress-regulated regulatory element and 
introduced into a plant. Where the aptamer is selected for binding to a polypeptide that 
normally is expressed from the regulatory element and is involved in an adaptive response of 
the plant to a stress, the recombinant molecule comprising the aptamer can be useful for 
inhibiting the activity of the stress-regulated polypeptide, thereby decreasing the tolerance of 

20 the plant to the stress condition. 

The invention provides a genetically modified plant, which can be a transgenic plant, 
that is tolerant or resistant to a stress condition. As used herein, the term "tolerant" or 
"resistant," when used in reference to a stress condition of a plant, means that the particular 
plant, when exposed to a stress condition, shows less of an effect, or no effect, in response to 

25 the condition as compared to a corresponding reference plant (naturally occurring wild-type 
plant or a plant not containing a construct of the present invention). As a consequence, a 
plant encompassed within the present invention grows better under more widely varying 
conditions, has higher yields and/or produces more seeds. Thus, a transgenic plant produced 
according to a method of the invention can demonstrate protection (as compared to a 

30 corresponding reference plant) from a delay to complete inhibition of alteration in cellular 
metabolism, or reduced cell growth or cell death caused by the stress. Preferably, the 
transgenic plant is capable of substantially normal growth under environmental conditions 
where the corresponding reference plant shows reduced growth, metabolism or viability, or 
increased male or female sterility. 
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The determination that a plant modified according to a method of the invention has 
increased resistance to a stress-inducing condition can be made by comparing the treated 
plant with a control (reference) plant using well known methods. For example, a plant 
having increased tolerance to saline stress can be identified by growing the plant on a 
5 medium such as soil, which contains a higher content of salt in the order of at least about 
10% compared to a medium the corresponding reference plant is capable of growing on. 
Advantageously, a plant treated according to a method of the invention can grow on a 
medium or soil containing at least about 50%, or more than about 75%, or more than about 
100%, or more than about 200% salt than the medium or soil on which a corresponding 

10 reference plant can grow. In particular, such a treated plant can grow on medium or soil 
containing at least 40 mM, at least 1 00 mM, at least 200 mM, or at least 300 mM salt, 
including, for example, a water soluble inorganic salt such as sodium sulfate, magnesium 
sulfate, calcium sulfate, sodium chloride, magnesium chloride, calcium chloride, potassium 
chloride, or the like; salts of agricultural fertilizers, and salts associated with alkaline or acid 

1 5 soil conditions; particularly NaCl. 

In another embodiment, the invention provides a plant that is less tolerant or less 
resistant to a stress condition as compared to a corresponding reference plant. As used 
herein, the term "less tolerant" or "less resistant," when used in reference to a stress condition 
of a plant, means that the particular plant, when exposed to a stress condition, shows an 

20 alteration in response to the condition as compared to a corresponding reference plant. In one 
embodiment, the alteration is response is at least 5%, in another at least 10% and in still 
another at least 25% when compared to the reference plant. As a consequence, such a plant, 
which generally is an undesirable plant species, is less likely to grow when exposed to a 
stress condition than an untreated plant. 

25 The present invention also relates to a method of expressing a heterologous nucleotide 

sequence in a plant cell. Such a method can be performed, for example, by introducing into 
the plant cell a plant stress-regulated regulatory element operatively linked to the 
heterologous nucleotide sequence, whereby, upon exposure of the plant cell to stress 
condition, the heterologous nucleotide sequence is expressed in the plant cell. The 

30 heterologous nucleotide sequence can encode a selectable marker, or a polypeptide that 
confers a desirable trait upon the plant cell, for example, a polypeptide that improves the 
nutritional value, digestibility or ornamental value of the plant cell, or a plant comprising the 
plant cell. Accordingly, the invention provides a transgenic plant that, in response to a stress 
condition, can produce a heterologous polypeptide from a plant stress-regulated regulatory 
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element. Such transgenic plants can provide the advantage that, when grown in a cold 
environment for example, expression of the heterologous polypeptide from a plant cold- 
regulated regulatory element can result in increased nutritional value of the plant. 

The present invention further relates to a method of modulating the activity of a 
5 biological pathway in a plant cell, wherein the pathway involves a stress-regulated 
polypeptide. As used herein, reference to a pathway that "involves" a stress-regulated 
polypeptide means that the polypeptide is required for normal function of the pathway. For 
example, plant stress-regulated polypeptides as disclosed herein include those acting as 
kinases or as transcription factors, which are well known to be involved in signal 

10 transduction pathways. As such, a method of the invention provides a means to modulate 
biological pathways involving plant stress-regulated polypeptides, for example, by altering 
the expression of the polypeptides in response to a stress condition. Thus, a method of the 
invention can be performed, for example, by introducing a polynucleotide portion of a plant 
stress-regulated nucleotide sequence into the plant cell, thereby modulating the activity of the 

15 biological pathway. A method of the invention can be performed with respect to a pathway 
involving any of the stress-regulated polypeptides as encoded by a polynucleotide of 
disclosed herein, including for example, a stress-regulated transcription factor, an enzyme, 
including a kinase, a channel protein, or the like. 

The present invention also relates to a method of identifying a polynucleotide that 

20 modulates a stress response in a plant cell. Such a method can be performed, for example, by 
contacting an array of probes representative of a plant cell genome and nucleic acid 
molecules expressed in plant cell exposed to the stress; detecting a nucleic acid molecule that 
is expressed at a level different from a level of expression in the absence of the stress; 
introducing the nucleic acid molecule that is expressed differently into a plant cell; and 

25 detecting a modulated response of the plant cell containing the introduced nucleic acid 

molecule to a stress, thereby identifying a polynucleotide that modulates a stress response in 
a plant cell. The contacting is under conditions that allow for specific hybridization of a 
nucleic acid molecule with probe having sufficient complementarity, for example, under 
stringent or highly stringent, hybridization conditions. 

30 As used herein, the term "array of probes representative of a plant cell genome" 

means an organized group of oligonucleotide probes that are linked to a solid support, for 
example, a microchip or a glass slide, wherein the probes can hybridize specifically and 
selectively to nucleic acid molecules expressed in a plant cell. Such an array is exemplified 
herein by a GeneChip® Arabidopsis Genome Array (Affymetrix; see Examples). In general, an 
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array of probes that is "representative" of a plant genome will identify at least about 30% of 
the expressed nucleic acid molecules in a plant cell, at least about 50% or 70%, at least about 
80% or 90%, or will identify all of the expressed nucleic acid molecules. It should be 
recognized that the greater the representation, the more likely all nucleotide sequences of 
5 cluster of stress-regulated nucleotide sequences will be identified. 

In addition, any polynucleotide of the present disclosure can be used for diagnostic 
purposes or to find related stress-responsive sequences in other species. Any polynucleotide 
provided herein may be attached in overlapping areas or at random locations on the solid 
support. Alternatively the polynucleotides of the invention may be attached in an ordered 

10 array wherein each polynucleotide is attached to a distinct region of the solid support that 
does not overlap with the attachment site of any other polynucleotide. In one instance, such 
an ordered array of polynucleotides is designed to be "addressable" where the distinct 
locations are recorded and can be accessed as part of an assay procedure. Addressable 
polynucleotide arrays typically include a plurality of different oligonucleotide probes that are 

15 coupled to a surface of a substrate in different known locations. The knowledge of the 
precise location of each polynucleotides location makes these "addressable" arrays 
particularly useful in hybridization assays. Any addressable array technology known in the 
art can be employed with the polynucleotides of the invention. One particular embodiment 
of these polynucleotide arrays is known as the Genechips™, and has been generally described 

20 in US Patent 5,143,854 and PCT publications WO 90/15070 and 92/10092. These arrays 
may generally be produced using mechanical synthesis methods or light directed synthesis 
methods that incorporate a combination of photolithographic methods and solid phase 
oligonucleotide synthesis. The immobilization of arrays of oligonucleotides on solid 
supports has been rendered possible by the development of a technology generally identified 

25 as "Very Large Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, typically, 
probes are immobilized in a high density array on a solid surface of a chip. Examples of 
VLSIPS™ technologies are provided in US Patents 5,143,854 and 5,412,087 and in PCT 
Publications WO 90/15070, WO 92/10092 and WO 95/11995, which describe methods for 
forming oligonucleotide arrays through techniques such as light-directed synthesis 

30 techniques. Further presentation strategies aimed at providing arrays of nucleotides 

immobilized on solid supports were developed to order and display the oligonucleotide arrays 
on the chips in an attempt to maximize hybridization patterns and sequence information as 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 
97/31256. 
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In another embodiment, an oligonucleotide probe matrix may advantageously be used 
to detect mutations occurring in a polynucleotide disclosed hereiin. For this particular 
purpose, probes are specifically designed to have a nucleotide sequence allowing their 
hybridization to the genes that carry known mutations (either by deletion, insertion or 
5 substitution of one or several nucleotides). By "known mutations" it is meant, mutations of a 
polynucleotide including any of those disclosed herein, that have been identified using 
techniques known in the art. 

Another technique that is used to detect mutations in a polynucleotide including any 
stress-responsive sequence disclosed herein is the use of a high-density DNA array, where 

10 single base mutations are encompassed by this technique. Each oligonucleotide probe 
constituting a unit element of the high density DNA array is designed to match a specific 
subsequence of the genomic DNA or cDNA of interest. Thus, an array containing 
oligonucleotides complementary to subsequences of the target gene sequence is used to 
determine the identity of the target sequence with the 4< wild-type" nucleotide sequence, 

15 measure its amount, and detect differences between the target sequence and the reference 
wild-type nucleotide sequence. One such design termed a "4L tiled array", is implemented 
using a set of four probes (A, C, G, T), for example 15-nucleotide oligomers. In each set of 
four probes, the perfect complement will hybridize more strongly than mismatched probes. 
Consequently, a nucleotide target of length L is scanned for mutations with a tiled array 

20 containing 4L probes; the whole probe set containing all the possible mutations in the known 
wild reference sequence. The hybridization signals of the 15-mer probe set tiled array are 
perturbed by a single base change in the target sequence. As a consequence, there is a 
characteristic loss of signal or a "footprint" for the probes flanking a mutation position. This 
technique was described by Chee et al. (Science 274:610, 1996). 

25 Polynucleotides identified herein include those nucleotide sequences that are induced 

or repressed in response to a combination of stress conditions, but not to any of the stress 
conditions alone; and polynucleotides that are induced or repressed in response to a selected 
stress condition, but not to other stress conditions. Furthermore, polynucleotides whose 
response to a stress condition is temporally regulated are also included, such as 

30 polynucleotides that are induced early, late or continuously in a stress response. In addition, 
the polynucleotides are represented by a variety of cellular proteins, including transcription 
factors, enzymes such as kinases, channel proteins, and the like. 

The present invention additionally relates to a method of identifying a stress condition 
to which a plant cell was exposed. Such a method can be performed, for example, by 
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contacting nucleic acid molecules expressed in the plant cell with an array of probes 
representative of the plant cell genome; and detecting a profile of expressed nucleic acid 
molecules characteristic of a stress response, thereby identifying the stress condition to which 
the plant cell was exposed. The contacting generally is under conditions that allow for 
5 specific hybridization of a nucleic acid molecule with probes having sufficient 

complementarity, for example, under stringent or highly stringent hybridization conditions. 
The profile can be characteristic of exposure to a single stress condition, for example, an 
abnormal level of cold, osmotic pressure, or salinity, or can be characteristic of exposure to 
more than one stress condition, for example, cold, increased osmotic pressure and increased 
10 salinity. 

The polynucleotides for which expression is determined and so the probes used may 
be varied depending on the particular plant and/or stress involved. In one embodiment, the 
plant is a cereal and expression is determined for at least one polynucleotide selected from 
the group consisting of those sequences disclosed herein. In another embodiment, the plant is 

1 5 a rice plant. It will be apparent to those of skill in the art, that though the use of various 
technologies, for example microarrays, it is possible and in many cases desirable to 
determine the expression of multiple stress-regulated polynucleotides at once. Thus, the 
preceding various embodiments include the identification of a stress condition to which a 
plant was exposed in which expression data is obtained on at least 10, at least 25, at least 50, 

20 at least 100, at least 250, at least 500, or at least 750 of the various groups of polynucleotide 
sequences described above. 

In one embodiment, the expression profile is produced by isolating RNA, for example 
mRNA from the test plant. Methods for the isolation of RNA from plants are well known in 
the art and can be found in standard reference texts such as those cited herein. In one 

25 embodiment, the RNA is transformed into cDNA by the use of reverse transcriptase using 
protocols that are well known to those skilled in the art of molecular biology. The RNA or 
cDNA is then hybridized to probes to the stress-regulated polynucleotides described herein 
under stringent, high stringency, or very high stringency conditions and hybridization 
detected. It is envisioned that multiple probes will be used for each polynucleotide expressed 

30 and that expression of multiple polynucleotides will be determined as detailed above. 

The method can be used to determine exposure to any stress to which results in 
altered expression of the described polynucleotide sequences. In one embodiment the stress 
is a single or combination abiotic stress such as cold stress, saline stress, osmotic stress or 
any combination thereof. In one embodiment, the expression profile from the test plant is 
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also compared to a control plant of the same species, for example an isogenic plant, that has 
not been exposed to a stress. 

In one embodiment of the invention, nucleic acid samples from the plant cells to be 
collected can be contacted with an array, then the profile can be compared with known 

5 profiles prepared from nucleic acid samples of plants exposed to known stresses. By creating 
a panel of such profiles, representative of various stress conditions, an unknown stress 
condition to which a plant was exposed can be identified simply by comparing the unknown 
profile with the known profiles and determining which known profile that matches the 
unknown profile. In one embodiment, the comparison is automated. Such a method can be 

10 useful, for example, to identify a cause of damage to a crop, where the condition causing the 
stress is not known or gradually increases over time. For example, accumulation in soils over 
time of salts from irrigation water can result in gradually decreasing crop yields. Because the 
accumulation is gradual, the cause of the decreased yield may not be readily apparent. Using 
the present methods, it is possible to evaluate the stress to which the plants are exposed, thus 

1 5 revealing the cause of the decreased yields. 

The present invention, therefore includes a computer readable medium containing 
executable instructions form receiving expression data for sequences substantially similar to 
any of those disclosed herein and comparing expression data from a test plant to a reference 
plant that has been exposed to an abiotic stress. Also provided is a computer-readable 

20 medium containing sequence data for sequences substantially similar to any of the sequences 
described herein, or the complements thereof, and a module for comparing such sequences to 
other nucleic acid sequences. 

Also provided are plants and plant cells comprising plant stress-regulatory elements 
of the present invention operably linked to a nucleotide sequence encoding a detectable 

25 signal. Such plants can be used as diagnostic or "sentinel plants" to provide early warning 
that nearby plants are being stressed so that appropriate actions can be taken. In one 
embodiment, the signal is one that alters the appearance of the plant. For example, an 
osmotic stress regulatory element of the present invention can be operably linked to a 
nucleotide sequence encoding a fluorescent protein such as green fluorescent protein, a 

30 yellow fluorescent protein, a cyan fluorescent protein, or a red fluorescent protein. When 
subjected to osmotic stress, the expression of the green fluorescent protein in the sentinel 
plant provides a visible signal so that appropriate actions can be taken to remove or alleviate " 
the stress. The use of fluorescent proteins in plants is known in the art and can be found, for 
example, in Leffel et al., Biotechniques 23:912, 1997. 



WO 03/008540 PCT/US02/19668 

69 

The invention further relates to a method of identifying an agent that modulates the 
activity of a stress-regulated regulatory element of a plant. As used herein, the term 
"modulate the activity," when used in reference to a plant stress-regulated regulatory element, 
means that expression of nucleotide sequence from the regulatory element is increased or 
5 decreased. In particular, expression can be increased or decreased with respect to the basal 
activity of the promoter, i.e., the level of expression, if any, in the absence of a stress 
condition that normally induces expression from the regulatory element; or can be increased 
or decreased with respect to the level of expression in the presence of the inducing stress 
condition. As such, an agent can act as a mimic of a stress condition, or can act to modulate 

10 the response to a stress condition. 

Such a method can be performed, for example, by contacting the regulatory element 
with an agent suspected of having the ability to modulate the activity of the regulatory 
element, and detecting a change in the activity of the regulatory element. In one 
embodiment, the regulatory element can be operatively linked to a heterologous 

15 polynucleotide encoding a reporter molecule, and an agent that modulates the activity of the 
stress-regulated regulatory element can be identified by detecting a change in expression of 
the reporter molecule due to contacting the regulatory element with the agent. Such a method 
can be performed in vitro in a plant cell-free system, or in a plant cell in culture or in a plant 
in situ. 

20 A method of the invention also can be performed by contacting the agent with a 

genetically modified cell or a transgenic plant containing an introduced plant stress-regulated 
regulatory element, and an agent that modulates the activity of the regulatory element is 
identified by detecting a phenotypic change in the modified cell or transgenic plant. 

A method of the invention can be performed in the presence or absence of the stress 

25 condition to which the particularly regulatory element is responsive. As such, the method 
can identify an agent that modulates the activity of plant stress-regulated promoter in 
response to the stress, for example, an agent that can enhance the stress response or can 
reduce the stress response. In particular, a method of the invention can identify an agent that 
selectively activates the stress-regulated regulatory elements of a cluster of plant stress- 

30 regulated nucleotide sequences, but does not affect the activity of other stress-regulated 
regulatory olynucleotides. As such, the method provides a means to identify an agent that 
acts as a stress mimic. Such agents can be particularly useful to prepare a plant fo an 
expected stress condition. For example, a agent that acts as a cold mimic can be applied to a 
field of plants prior to the arrival of an expected cold front. Thus, the cold stress response 
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can be induced prior to the actual cold weather, thereby providing the plants with the 
protection of the stress response, without the plants suffering from any initial damage due to 
the cold. Similarly, an osmotic pressure mimic can be applied to a crop of plants prior a field 
being flooded by a rising river. 

5 In one embodiment, the present invention provides a method for marker-assisted 

selection. Marker-assisted selection is a well-known method in the art and involves the 
selection of plant having desirable phenotypes based on the presence of particular nucleotide 
sequences known as markers. The use of makers allows plants to be selected early in 
development, often before the phenotype would normally manifest itself. Because it allows 

10 for early selection, marker-assisted selection decreases the amount of time need for selection 
and thus allows more rapid genetic progress. Briefly, marker-assisted selection involves 
obtaining nucleic acid from a plant to be selected. The nucleic acid obtained is then probed 
with probes that selectively hybridize under stringent or highly stringent, conditions to a 
nucleotide sequence or sequences associated with the desired phenotype. In one 

15 embodiment, the probes hybridize to any of the stress-responsive nucleotide sequences or 
regulatory elements disclosed herein. The presence of any hybridization products formed is 
detected and plants are then selected on the presence or absence of the hybridization 
products. 

The isolated polynucleotides of the invention can be used to create various types of 
20 genetic and physical maps of the genome of rice or other plants. Such maps are used to 
devise positional cloning strategies for isolating novel genes from the mapped crop species. 
The sequences of the present invention are also useful for chromosome mapping, 
chromosome identification, tagging genes of known and useful function, tagging genes to 
which a function has not yet been assigned, and including the uses set forth in U.S. Patent 
25 No. 5,817,479. 

In addition, because the genomes of closely related species are largely syntenic (that 
is, they display the same ordering of genes within the genome), these maps can be used to 
isolate novel alleles from wild relatives of crop species by positional cloning strategies. This 
shared synteny is very powerful for using genetic maps from one species to map genes in 
30 another. For example, a gene mapped in rice provides information for the gene location in 
maize and wheat. 

In one embodiment, the stress responsive sequences of the present invention are 
located in and can be used to identify Quantitative Trait Loci (QTLs) for a variety of uses, 
including marker-assisted breeding. Many important crop traits are quantitative traits and 
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result from the combined interactions of several genes. These genes reside at different loci in 
the genome, often on different chromosomes, and generally exhibit multiple alleles at each 
locus. Developing markers, tools, and methods to identify and isolate the QTLs involved in a 
trait, enables marker-assisted breeding to enhance desirable traits or suppress undesirable 
traits. The sequences of the invention can be used to identify QTLs and isolate alleles as 
described by Li et al. in a study of QTLs involved in resistance to a pathogen of rice. (Li et 
aL, Mol Gen Genet . 261:58, 1999). 

In particular SEQ ID Nos. listed in Table 1 have been mapped to a drought resistance 
OTL located on chromosome 8 of rice (Zhang et al., Thero. Aopl. Genet.. 103:19-29, 2000). 
This OTL is syntenic with drought resistance OTLs located on wheat chromosome 7S and 
barley chromosome 1. In addition to supporting the role of these sequences in drought 
resistance, these data show that support a role for these sequences in drought resistance for a 
variety of cereals. Likewise, additional SEQ ID Nos. listed in Table 1 are located in the 
drought resistance QTL on chromosome 3 of rice (Zhang et al., Thero. Appl. Genet .. 103:19- 
29, 2000). This QTL is syntenic with a drought resistance QTL located on maize 
choromosome 1 again, supporting the role of the polynucletotides in drought resistance in a 
variety of species. 

In addition to isolating QTL alleles in rice, other cereals, and other monocot and dicot 
crop species, the sequences of the invention can also be used to isolate alleles from the 
corresponding QTL(s) of wild relatives. Transgenic plants having various combinations of 
QTL alleles can then be created and the effects of the combinations measured. Once an ideal 
allele combination has been identified, crop improvement can be accomplished either through 
biotechnological means or by directed conventional breeding programs. (Flowers et al., J 
Exp^51:99,2000;TaiiksleyandMcCouch,Sdence,277:1063, 1997). 

Polynucleotides derived from sequences of the present invention are useful to detect 
the presence in a test sample of at least one copy of a nucleotide sequence containing the 
same or substantially the same sequence, or a fragment, complement, or variant thereof. The 
sequence of the probes and/or primers of the instant invention need not be identical to those 
provided in the Sequence Listing or the complements thereof. Some variation in probe or 
primer sequence and/or length can allow additional family members to be detected, as well as 
orthologous genes and more taxonomicaUy distant related sequences. Similarly probes 
and/or primers of the invention can include additional nucleotides that serve as a label for 
detecting duplexes, for isolation of duplexed polynucleotides, or for cloning purposes. 
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Probes and primers of the invention include isolated, purified, or recombinant 
polynucleotides containing a contiguous span of between at least 12 to at least 1000 
nucleotides of any of sequences disclosed herein or the complements thereof, with each 
individual number of nucleotides within this range also being part of the invention. 
5 Examples are isolated, purified, or recombinant polynucleotides containing a contiguous span 
of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 300, 400, 500, 750, 
or 1000 nucleotides of any of the sequences disclosed herein or the complements thereof. 
The appropriate length for primers and probes will vary depending on the application. For 
use as PCR primers, probes are 12-40 nucleotides, typically 18-30 nucleotides long. For use 

10 in mapping, probes are 50 to 500 nucleotides, typically 100-250 nucleotides long. For use in 
Southern hybridizations, probes as long as several kilobases can be used. The appropriate 
length for primers and probes under a particular set of assay conditions may be empirically 
determined by one of skill in the art. 

The primers and probes can be prepared by any suitable' method, including, for 

1 5 example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 
method such as the phosphodiester method of Narang et al. (Meth Enzvmoh 68: 90, 1979), 
the diethylphosphoramidite method, the triester method of Matteucci et al. (J Am Chem Soc. 
103: 3185, 1981), or according to Urdea et al. (Proc Natl Acad Sci. USA. 80: 7461, 1981), 
the solid support method described in EP 0 707 592, or using commercially available 

20 automated oligonucleotide synthesizers. 

Detection probes are generally nucleotide sequences or uncharged nucleotide analogs 
such as, for example peptide nucleotides which are disclosed in International Patent 
Application WO 92/20702, morpholino analogs which are described in U.S. Patent Nos. 
5,185,444, 5,034,506 and 5,142,047. The probe may have to be rendered "non-extendable" 

25 such that additional dNTPs cannot be added to the probe. Analogs are usually non- 
extendable, and nucleotide probes can be rendered non-extendable by modifying the 3' end 
of the probe such that the hydroxyl group is no longer capable of participating in elongation. 
For example, the 3' end of the probe can be functionalized with the capture or detection label 
to thereby consume or otherwise block the hydroxyl group. Alternatively, the 3' hydroxyl 

30 group simply can be cleaved, replaced or modified so as to render the probe non-extendable. 
Any of the polynucleotides of the present invention can be labeled, if desired, by 
incorporating a label detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 
substances ( 32 P, 35 S, 3 H, ,25 I), fluorescent dyes (5-bromodesoxyuridine, fluorescein, 
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acetylaminofluorene, digoxigenin) or biotin. In one embodiment, polynucleotides are labeled 
at their 3' and 5' ends. Examples of non-radioactive labeling of nucleotide fragments are 
described in the French patent No. FR-7810975 and by Urdea et al. (Nuc Acids Res. 16:4937, 
1988). In addition, the probes according to the present invention may have structural 
5 characteristics such that they allow the signal amplification, such structural characteristics 
being, for example, branched DNA probes as described in EP 0 225 807. 

A label can also be used to capture the primer so as to facilitate the immobilization of 
either the primer or a primer extension product, such as amplified DNA, on a solid support. 
A capture label is attached to the primers or probes and can be a specific binding member that 

1 0 forms a binding pair with the solid's phase reagent's specific binding member, for example 
biotin and streptavidin. Therefore depending upon the type of label carried by a 
polynucleotide or a probe, it may be employed to capture or to detect the target DNA. 
Further, it will be understood that the polynucleotides, primers or probes provided herein, 
may, themselves, serve as the capture label. For example, in the case where a solid phase 

1 5 reagent's binding member is a nucleotide sequence, it may be selected such that it binds a 
complementary portion of a primer or probe to thereby immobilize the primer or probe to the 
solid phase. In cases where a polynucleotide probe itself serves as the binding member, those 
skilled in the art will recognize that the probe will contain a sequence or "tail" that is not 
complementary to the target. In the case where a polynucleotide primer itself serves as the 

20 capture label, at least a portion of the primer will be free to hybridize with a nucleotide on a 
solid phase. DNA labeling techniques are well known in the art. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. Solid supports are known to those skilled in 
the art and include the walls of wells of a reaction tray, test tubes, polystyrene beads, 

25 magnetic beads, nitrocellulose strips, membranes, microparticles such as latex particles, 

sheep (or other animal) red blood cells, duracytes and others. The solid support is not critical 
and can be selected by one skilled in the art. Thus, latex particles, microparticles, magnetic 
or non-magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon 
chips, sheep (or other suitable animal's) red blood cells and duracytes are all suitable 

30 examples. Suitable methods for immobilizing nucleotides on solid phases include ionic, 

hydrophobic, covalent interactions and the like. A solid support, as used herein, refers to any 
material that is insoluble, or can be made insoluble by a subsequent reaction. The solid " 
support can be chosen for its intrinsic ability to attract and immobilize the capture reagent. 
Alternatively, the solid phase can retain an additional receptor that has the ability to attract 
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and immobilize the capture reagent. The additional receptor can include a charged substance 
that is oppositely charged with respect to the capture reagent itself or to a charged substance 
conjugated to the capture reagent. As yet another alternative, the receptor molecule can be 
any specific binding member which is immobilized upon (attached to) the solid support and 
5 which has the ability to immobilize the capture reagent through a specific binding reaction. 
The receptor molecule enables the indirect binding of the capture reagent to a solid support 
material before the performance of the assay or during the performance of the assay. The 
solid phase thus can be a plastic, derivatized plastic, magnetic or non-magnetic metal, glass 
or silicon surface of a test tube, microtiter well, sheet, bead, microparticle, chip, sheep (or 

10 other suitable animal's) red blood cells, duracytes and other configurations known to those of 
ordinary skill in the art. The polynucleotides of the invention can be attached to or 
immobilized on a solid support individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 
25 distinct polynucleotides of the invention to a single solid support. In addition, 
polynucleotides other than those of the invention may be attached to the same solid support 

1 5 as one or more polynucleotides of the invention. 

Probes and primers of the invention can be used to identify and/or isolate 
polynucleotides related to the stress responsive sequences provided in the Sequence Listing, 
or allelic variants of the stress responsive sequences. Generally, related polynucleotides have 
similar sequences or encode polypeptides with similar biological activity, but are found at 

20 other loci within an organism, or are found in other organisms. Identification and isolation of 
related sequences can provide important tools for functional genomics, to study the evolution 
of genomes, and to predict gene and protein function, interaction, and regulation. Related 
sequences including paralogs and orthologs are particularly important in identifying Clusters 
of Orthologous Groups (COGs) of proteins, which aids protein function prediction and the 

25 functional and phylogenetic annotation of newly sequenced genomes. 

Hybridization of the stress-responsive sequences of the invention to nucleotides 
obtained from other organisms can be used to identify and isolate paralogous sequences, or 
paralogs, which are additional members of gene families. The terms "paralogous sequence" 
and "paralog" as used herein encompass both full-length genes and regions and fragments 

30 thereof. Paralogs may be located in the same or a different region of the genome in which the 
sequence used as a probe is located. Paralogs generally have a high sequence identity with 
the probe sequence or the gene from which the probe was prepared; however, paralogs may 
have overall sequence identity with a probe sequence as low as 20 to 30% and still be 
recognizable as members of the same gene family with similar functions, as reported by 
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Takata et al. for RAD51B paralogs (Mol Cell Biol, 20:6476, 2000). When overall sequence 
identity is not high, the sequence similarity among paralogs of a gene family is often 
concentrated into one or a few portions of the sequence, notably in a portion encoding a 
protein or RNA that has an enzymatic or structural function. The degree of identity in the 

5 amino acid sequence of the domain that defines the gene family can be as low as 20%, but is 
often at least 50%, at least 75%, at least 80 to 95%, or at least 85 to 99%. Paralogs may 
differ in their expression profiles, indicating that they may act at different time, a different 
place, or at a different developmental stage, even when their function appears to be similar. 
Differences in function among paralogs may suggest that paralogs encode polypeptides that 

10 are "remodeled" during plant evolution, for example to create new forms of oxidized 
carotenoids in tomato as described by Bouvier et al. (Eur J Biochem. 267:6346, 2000). 

In one embodiment, paralogs may be isolated by hybridizing a stress-resonsive 
sequence probe to a Southern blot containing the appropriate genomic DNA or cDNA of the 
organism. To search for paralogs within a species, low stringency hybridization is usually 

15 performed, but will depend on size, distribution and degree of sequence divergence.of 

domains that define the gene family. Given the resulting hybridization data, one or ordinary 
skill in the art could distinguish and isolate the correct DNA fragments by size, restriction 
sites and stated hybridization conditions from a gel or from a library. Alternately, paralogs 
may be isolated by large-scale sequencing followed by BLAST analysis of sequences to 

20 identify putative paralogs, as described by Ospina-Giraldo et al,. (Fungal Genet Biol 29:81 , 
2000). In another embodiment, paralogs may be isolated using reverse-transcriptase 
polymerase chain reaction (RT-PCR) using primers to conserved regions of sequence. 
Paralogs may be cloned using standard techniques to screen libraries using at least one stress- 
responsive sequence as a probe. 

25 The stress-responsive sequences disclosed herein can also be used to determine 

orthologous sequences. An orthologous sequence, or orthologous gene, or ortholog, has a 
high degree of sequence similarity to a known sequence or gene of interest, with the 
similarity often occurring along the entire length of the coding portion of the gene. The terms 
"orthologous sequence" and "ortholog" as used herein encompass both full-length genes and 

30 functional regions and fragments thereof. An ortholog often encodes a gene product that 
performs a similar function in the organism. Functions for orthologous genes are expected to 
be the same as or very similar to that of the gene from which the probe was prepared. The 
degree of identity is a function of evolutionary separation and, in closely related species, the 
degree of sequence identity can be 98 to 100%. Orthologous sequences sometimes have 
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significantly lower levels of sequence identity, for example as described by Weise et al. 
(Plant Cell 12:1345, 2000) where orthologs of sucrose transporters from Arabidopsis, 
tomato, and potato had 47% similarity to the previously characterized sucrose transporter. 
The amino acid sequence of a protein encoded by an orthologous gene can be less than 50% 

5 identical, but tends to be at least 50%, or at least 70% or at least 80% identical, or at least 
90%, or at least 95% identical to the amino acid sequence of the reference protein. 

To find orthologs, probes are hybridized to nucleotides from a species of interest 
under low stringency conditions and blots are then washed under conditions of increasing 
stringency. It is preferable that the wash stringency be such that sequences that are 85 to 

10 100% identical will hybridize. More preferably, sequences 90 to 100% identical will 

hybridize and most preferably only sequences greater than 95% identical will hybridize. The 
low stringency condition is preferably one where sequences containing as much as 40-45% 
mismatches will be able to hybridize. This condition is established by T m - 40°C to T m - 
48°C. One of ordinary skill in the art will recognize that, due to degeneracy in the genetic 

1 5 code, amino acid sequences that are identical can be encoded by DNA sequences as little as 
67% identical. Thus, it is preferable to make an overlapping series of shorter probes, on the 
order of 24 to 45 nucleotides, and individually hybridize them to the same arrayed library to 
avoid the problem of degeneracy introducing large numbers of mismatches. 

In one embodiment, orthologous sequences, or orthologs, may be isolated by 

20 hybridizing an stress-responsive sequence probe to a Southern blot containing the appropriate 
genomic DNA or cDNA of a different organism, for example, another cereal. Alternately, 
orthologs may be isolated by large-scale sequencing followed by BLAST analysis of 
sequences to identify putative orthologous sequences and full-length orthologs. In another 
embodiment, orthologs may be isolated using reverse-transcriptase polymerase chain reaction 

25 (RT-PCR) using primers to conserved regions of sequence in a stress responsive sequence. 
Orthologs and/or orthologous sequences may be cloned using standard techniques to screen 
libraries using at least one stress-responsive sequence as a probe. 

As evolutionary divergence increases, genome sequences also tend to diverge. Thus, 
one of skill will recognize that searches for orthologous genes between more divergent 

30 species will require the use of lower stringency conditions compared to searches between 
closely related species. Also, degeneracy is more of a problem for searches in the genome of 
a species more distant evolutionarily from the species that is the source of the stress- 
responsive probe sequences. 



*r 
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Identification of the relationship of nucleotide or amino acid sequences among plant 
species can be done by comparison of the subject nucleotide or amino acid sequence with the 
sequences of the present application presented in the Sequence Listing. 

Sequences disclosed herein can also be used to isolate corresponding DNA by 

5 Southern blotting. Probes for Southern blotting to distinguish individual restriction 
fragments can range in size from 15 to 20 nucleotides to several thousand nucleotides. 
Typically, the probe is 100 to 1000 nucleotides long for identifying members of a gene 
family when it is found that repetitive sequences would complicate the hybridization. For 
identifying an entire corresponding gene in another species, the probe is more often the 

10 length of the gene, typically 2000 to 10,000 nucleotides, but probes 50-1 ,000 nucleotides 
long might be used. Some genes, however, might require probes up to 15,000 nucleotides 
long or overlapping probes constituting the full-length sequence to span their lengths. 

In one embodiment, the probe derived from sequences of the present invention is 
homogeneous, having a single sequence. In another embodiment, a probe designed to 

1 5 represent or identify members of a gene family having diverse sequences can be generated 
using PCR to amplify genomic DNA or RNA templates using primers derived from stress- 
responsive sequences that include sequences that define the gene family. 

For identifying corresponding genes in another species, the probe for Southern 
blotting most preferably would be the genomic copy of the probe gene. This allows all 

20 elements of the gene to be identified in the other species. The next most preferable probe is a 
cDNA spanning the entire coding sequence which allows the entire mRNA-coding portion of 
the gene to be identified; in this case it is possible that some introns in the gene might be 
missed. Probes for Southern blotting can easily be generated from stress-responsive 
sequences by making primers having the sequence at the ends of the sequence and using rice 

25 (Oryza sativa) genomic DNA as a template. In instances where the sequence includes 

sequence conserved among species, primers including the conserved sequence can be used 
for PCR with genomic DNA from a species of interest to obtain a probe. Similarly, if the 
sequence includes a domain of interest, that portion of the sequence can be used to make 
primers and, with appropriate template DNA, used to make a probe to identify genes 

30 containing the domain. Alternatively, the PCR products can be resolved, for example by gel 
electrophoresis, and cloned and/or sequenced. In this manner, the variants of the domain 
among members of a gene family, both within and across species, can be examined. 

The sequences of the invention can be used for library screening to isolate the 
corresponding DNA from the same organism or other organisms. Either cDNA or genomic 
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DNA can be isolated. Libraries of genomic DNA, or lambda, cosmid, BAC or YAC, or other 
large insert genomic library from the plant of interest can be constructed using standard 
molecular biology techniques as described in detail by Sambrook et ah, ( Molecular Cloning: 
A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1989) and by Ausubel et al. 
5 (Current Protocols in Molecular Biology. Greene Publishing, 1992) with updates). 

To screen a phage library, recombinant lambda clones are plated out on appropriate 
bacterial medium using an appropriate E. coli host strain. The resulting plaques are lifted 
from the plates using nylon or nitrocellulose filters. The plaque lifts are processed through 
denaturation, neutralization, and washing treatments following the standard protocols 

10 outlined by Ausubel et al. (1992 supra). The plaque lifts are hybridized to either 
radioactively labeled or non-radioactively labeled stress responsive DNA at room 
temperature for about 16 hours, usually in the presence of 50% formamide and 5X SSC 
(sodium chloride and sodium citrate) buffer and blocking reagents. The plaque lifts are then 
washed at 42°C with 1% sodium dodecyl sulfate (SDS) and at a particular concentration of 

1 5 SSC. The SSC concentration used is dependent upon the stringency at which hybridization 
occurred in the initial Southern blot analysis performed. For example, if a fragment 
hybridized under medium stringency such as T m - 20°C, then this condition is maintained or 
adjusted to a less stringent condition such as T m - 30°C, to wash the plaque lifts. Positive 
clones showing hybridization to the probe are detected by exposure to X-ray films or 

20 chromogen formation or any other suitable detection method, and subsequently isolated for 
purification using the same general protocol outlined above. Once the clone is purified, 
restriction analysis can be conducted to narrow the region corresponding to the gene of 
interest. Restriction analysis and succeeding subcloning steps can be done using procedures 
described by, for example, Sambrook et al. (1989, supra). 

25 To screen a YAC library, the procedures outlined for the lambda library are 

essentially similar except the YAC clones are harbored in bacterial colonies. The YAC 
clones are plated out at reasonable density on nitrocellulose or nylon filters supported by 
appropriate bacterial medium in petri plates. Following the growth of the bacterial clones, 
the filters are processed through the denaturation, neutralization, and washing steps following 

30 the procedures of Ausubel et al. (1992, supra). The same hybridization procedures for 
lambda library screening are followed. 

To isolate cDNA, similar procedures using appropriately modified vectors are 
employed. For instance, the library can be constructed in a lambda vector appropriate for 
cloning cDNA such as Xgtll. Alternatively, the cDNA library can be made in a plasmid 
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vector. cDNA for cloning can be prepared by any of the methods known in the art, but is 
preferably prepared as described above. Preferably, a cDNA library will include a high 
proportion of full-length clones. 

Identification and isolation of alleles and paralogs within a species, and orthologs 
5 from other species, is particularly desirable because of their potential use as a tool for crop 
improvement especially for quantitative traits such as resistance to abiotic stress. By 
identifying and isolating numerous alleles for each locus from a single species or from 
different species, transgenic plants having various combinations of alleles can be created and 
the effects of the combinations measured. Once a more favorable ideal allele combination 

10 has been identified, crop improvement can be accomplished either through biotechnological 
means or by traditional (conventional) breeding programs. (Tanksley et al., Science 
277: 1063, 1997). In a similar manner, substitution of at least one paralogous or orthologous 
sequence in at least one locus will introduce diversity at each substituted locus, and the 
substituted sequences will contribute to a trait that is influenced by combined interactions of 

1 5 the products of several genes residing at different loci in the genome. When favorable 
combinations of substituted sequences and endogenous sequences at loci whose products 
interact are identified, crop improvement can be accomplished through further 
biotechnological manipulation or by traditional breeding programs, including sexual crossing 
and also apomixis. 

20 The results from hybridization of the sequences of the invention to Southern blots 

containing DNA from another species can be used to generate restriction fragment maps for 
the corresponding genomic regions. These maps provide additional information about the 
relative positions of restriction sites within fragments, further distinguishing mapped DNA 
from the remainder of the genome. Physical maps can be made by digesting genomic DNA 

25 with different combinations of restriction enzymes. 

Sequence analysis and mapping of related sequences (paralogs and orthologs) can be 
used in phylogenetic analyses of the evolution of the sequences in question, including the 
determination of gene duplication and rearrangements. In addition, expression studies of 
related sequences can be used to further understand the evolutionary history and function of 

30 the paralogs and orthologs, and to suggest future uses for the sequences. 

Isolated polynucleotides within the scope of the invention also include allelic variants 
of the specific sequences presented in the sequence listing. An "allelic variant" is a sequence 
that is a variant from that of the stress-responsive sequence, but represents the same 
chromosomal locus in the organism. Allelic variants can arise by normal genetic variation in 
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a population. Allelic variants can also be produced by genetic engineering methods, for 
example by the method of chimeraplasty using chimeric oligonucleotides to introduce a 
single nucleotide base substitution in a target sequence, as described by Beetham et al. (Proc. 
Natl Acad. Sci. USA . 96:8774, 1999) and Zhu et al., (Proc. Natl. Acad, ScL USA , 96:8768, 
5 1999). An allelic variant can be one that is found naturally occurring in a plant, including a 
cultivar or ecotype. An allele can give rise to detectably distinct phenotypic and expression 
profiles. An allelic variant may or may not give rise to a phenotypic change, and may or may 
not be expressed. An expressed allele can result in a detectable change in the phenotype of 
the trait represented by the locus. Allelic variations can occur in any portion of the gene 
10 sequence, including regulatory regions as well as structural regions. The stress responsive 
sequences of the present invention are useful to detect and/or isolate allelic variants, and may 
be used to introduce an allelic variant at a locus. Thus, present sequences can be used to 
manipulate the allelic diversity of a plant or a population. 



15 possibility to substitute at least one base of the base sequence of a gene with a different base 
without causing the amino acid sequence of the polypeptide produced from the gene to be 
changed. Hence, the sequences of the present invention may also have any base sequence 
that has been changed from a sequence as provided in the Sequence Listing by substitution in 
accordance with degeneracy of genetic code. References describing codon usage include: 

20 Carels et aL JMol Evol 46:45. 1998 and Fennoy et al., Nucl Acids Res. 21:5294, 1993. 



containing a universal stress protein A (USPA) domain and polynucleotides encoding said 
polypeptides. The gene encoding proteins containing the USPA domain was originally found 
in E. coli (Nystrom and Neidhardt, Mol. Microbiol.. 6:3187-3198, 1992). The uspA gene is 
25 unique in its almost universal responsiveness to diverse stresses. Rice stress responsive 
polypeptides containing a USPA domain are listed in Table 1 . 

Examples 

The following examples are intended to provide illustrations of the application of the 
30 present invention. The following examples are not intended t<5 completely define or 
otherwise limit the scope of the invention. 



With respect to nucleotide sequences, degeneracy of the genetic code provides the 



One embodiment of the invention comprises stress-responsive polypeptides 
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Example 1: Isolation and sequencing of DNA fragments 
1 . J Isolation and sequencing of genomic DNA fragments 

Genomic DNA was isolated from nuclei of Oryza sativa L. ssp japonica cv 
Nipponbare and then sheared to produce fragments of approximately 500 bp. Using a 
5 method derived from the method of Mao et al {Genome Res 10:982 (2000)), seeds were 
germinated on cheese cloth immersed in water and grown for 4-6 weeks under greenhouse 
conditions. After plants reached a height of approximately 5-8 inches, the upper parts of the 
green leaves were harvested and wrapped in aluminum foil at 4°C overnight. Leaf material 
was then stored at -80°C or directly used for extraction of nuclei. Intact nuclei were isolated 

10 by homogenization (in a blender for fresh material or by grinding with mortar and pestle for 
frozen material) in a buffer containing 10 mM Trizma base, 80 mM KC1, 10 mM EDTA, 1 
mM spermidine, 1 mM spermine, 0.5 M sucrose, 0.5% Triton-X-100, 0.15% P- 
mercaptoethanol pH 9.5. The homogenate was filtered and nuclei recovered by gentle 
centrifugation using a fixed-angle rotor at 1 ,800 g at 4°C for 20 minutes. The pellet 

1 5 recovered after centrifugation was gently resuspended with the assistance of a small paint 
brush soaked in ice cold wash buffer and wash buffer added. Particulate matter remaining in 
the suspension was removed by filtering the resuspended nuclei into a 50 ml centrifuge tube 
through two layers of miracloth by gravity and centrifuging the filtrate at 57 g (500 rpm), 4 C 
for 2 minutes to remove intact cells and tissue residues. The supernatant fluid was 

20 transferred into a fresh centrifuge tube and nuclei were pelleted by centrifugation at. 1 ,800 g, 
4 C for 15 minutes in a swinging bucket centrifuge. 

DNA was isolated from the nuclear preparation by phenol/chloroform extraction, as 
in Sambrook et al (supra). Isolated total genomic DNA was physically sheared (Hydroshear) 
to generate for generating random DNA fragments, and fragments of approximately 500 bp 

25 were recovered. DNA was eluted and the ends filled in using T 4 DNA polymerase, Klenow 
fragments, and dNTPs. Double-stranded DNA was linkered and cloned into a proprietary 
medium-copy vector derived from pSCl 01 . 

Vector inserts were amplified by PCR and sequenced using the MegaBACE 
sequencing system (Molecular Dynamics, Amersham). The amplification reaction was 

30 diluted before use and was not purified using an exonuclease/alkaline phosphatase procedure. 
Sequencing reactions were performed using DYEnamic ET Terminator Kit. The reactions 
contained approximately 50 ng of amplicon, DYEnamic ET Terminator premix, and 5 pmol 
of -40 M13 forward primer. The sequencing reaction is amplified for 30 cycles, and reaction 
products are concentrated and purified using ethanol precipitation. The sample was 



WO 03/008540 PCTYUS02/19668 

82 

electrokinetically injected into the capillary at 3 kV for 45 sec and separated via 

electrophoresis at 9 kV for 120 min. 

1.2 Isolation and sequencing of cDNA fragments 

Construction of rice cDNA library. Total RNA was purified from rice plant tissue 
5 using standard total RNA purification methods. PolyA+ RNA was isolated from the total 
RNA using the Qiagen Oligotex mRNA purification system (Qiagen, Valencia, CA), and 
cDNA was generated using cDNA synthesis reagents from Life Technologies (Rockville, 
MD). First strand cDNA synthesis was catalyzed by reverse transcriptase using oligo dT 
primers with a NotI restriction site. Second strand synthesis was catalyzed by DNA 

1 0 polymerase. An oligonucleotide linker with a Sail restriction endonuclease site was attached 
to the 5' end of the cDNAs using DNA ligase. The cDNAs were digested with NotI and Sail 
restriction endonucleases and inserted into an E. co/Z-replicating plasmid harboring a 
selectable marker. E. coli was transfected with the recombinant plasmids and grown on 
selectable media. E. coli colonies were individually picked off the selectable media and 

1 5 placed into storage plates. 

Sequencing the rice cDNA library, The DNA sequence of the cDNA cloned into the 
plasmid purified from an E. coli colony was determined using standard dideoxy sequencing 
methods. Oligonucleotide primers respectively corresponding to plasmid DNA regions 
upstream of the 5' end of the cDNA insert (Forward reaction) and downstream of the 3' end 

20 of the cDNA insert (Reverse reaction) were used in the dideoxy sequencing reactions. If the 
DNA sequence determined as a result of the Forward and Reverse reactions from the cDNA 
overlapped, the two sequences could be merged into a contig using computerized analysis 
software (Consed, University of Washington,Seattle), to assemble a full-length sequence of 
the cDNA. In cases where DNA sequence from the Forward and Reverse reactions from a 

25 single clone did not overlap sufficiently to be assembled into a contig, such that there was a 
region of unsequenced DNA to bridge the DNA from the Forward and Reverse reaction in 
order to form a contig, the DNA sequence of the separating region was determined using one 
of two dideoxy sequencing methods. In a "primer walking" approach, a primer specifically 
corresponding to the 3' end of the DNA sequence determined from the Forward reaction was 

30 used in a second dedeoxy sequencing reaction. The primer walking procedure was repeated 
until the DNA sequence that separated the original Forward and Reverse was resolved and a 
contig could be assembled. Alternatively, the clone harboring the cDNA was subjected to 
transposon in vitro insertion dideoxysequencing (Epicentre, Madison, WI). In this procedure, 
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the insertion process was random and the result was multiple DNA sequence coverage over 
the targeted cDNA, where the sequences thus obtained were assembled into a contig. 

Example 2: GeneChiv ® Standard Protocol 
5 The standard protocol for using the GeneChip® to quantitatively measure plant gene 

expression was carried out as outlined below: 
Quantitation of total RNA 

1 . Total RNA from plant tissue was extracted and quantified.Quantified total RNA 
using GeneQuant 

10 10D 2 6o=40 mg RNA/ml; A 260 /A280=l -9 to about 2.1 

2. Ran gel to check the integrity and purity of the extracted RNA 
Synthesis of double-stranded cDNA 

Gibco/BRL Superscript Choice System for cDNA Synthesis (Cat#lB090-019) was 
employed to prepare cDNAs. T7-(dT) 2 4 oligonucleotides were prepared and purified by 
15 HPLC. (5'- GGCCAGTGAATTGTAATACGACTCACTATAGGGAGGCGG-(dT) 24 -3'; 
SEQIDNO: 17507). 

Step 1. Primer hybridization: 

Incubated at 70°C for 10 minutes 
Spun quickly and put on ice briefly 
20 Step 2. Temperature adjustment: 

Incubated at 42°C for 2 minutes 
Step 3. First strand synthesis carried out using: 
DEPC-water- 1 :1 
RNA(10:gfinal)-10:l 
25 T7-(dT) 24 Primer (100 pmol final)- 1 :1 pmol 

5X 1 st strand cDNA buffer-4 :1 
0.1MDTT (10 mM final)- 2:1 
10 mM dNTP mix (500 :M final)-l :1 
Superscript II RT 200 U/:l- 1 :1 
30 Total of 20 :1 

Mixed well 

Incubated at 42°C for 1 hour 
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Step 4. Second strand synthesis: 

Placed reactions on ice, quick spin 
DEPC-water- 91 :1 
5X 2 nd strand cDNA buffer- 30 :1 
5 1 0 mM dNTP mix (250 mM final) -3:1 

E. coli DNA ligase (1 0 U/:l)-l :1 
E. coli DNA polymerase 1-10 U/:l- 4 :1 
RnaseH2U/:l-l :1 
T4 DNA polymerase 5 U/:l-2 :1 
10 0.5 M EDTA (0.5 M final)— 10 :1 

Total 162 :1 
Mixed/spun down/incubated 16°C for 2 hours 
Step 5. Completing the reaction: 

Incubated at 16°C for 5 minutes 
15 Purification of double stranded cDNA 

1. Centrifuged PLG (Phase Lock Gel, Eppendorf 5 Prime Inc., pi- 188233) at 14,000X, 

transfered 162 :1 of cDNA to PLG 

2. Added 162 :1 of Phenol:Chloroform:Isoamyl alcohol (pH 8.0), centrifuge 2 minutes 

3 . Transfered the supernatant to a fresh 1 .5 ml tube, add 

20 Glycogen (5 mg/ml) 2 

0.5 M NH4OAC (0.75xVol) 120 
ETOH(2.5xVol,-20°C) 400 

4. Mixed well and centrifuge at 14,000X for 20 minutes 

5. Removed supernatant, added 0.5 ml 80% EtOH (-20°C) 

25 6. Centrifuged for 5 minutes, air dry or by speed vac for 5- 1 0 minutes 

7.Added44:lDEPCH 2 0 
Analyzed quantity and size distribution of cDNA 

Ran a gel using 1 :1 ratio of the double-stranded synthesis product to loading buffer 
Synthesis of biotinvlated cRNA 
30 (used EnzoBioAirayHigh Yield RNA Transcript Labeling Kit 

Cat#900182) 

Purified cDNA 22 :1 

lOXHy buffer 4:1 
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10X biotin ribonucleotides 
10X DTT 

10X Rnase inhibitor mix 
20X T7 RNA polymerase 



4:1 
4:1 
4:1 

2:1 



Total 



40:1 



Centrifiiged 5 seconds, and incubated for 4 hours at 37°C 
Gently mixed every 30-45 minutes 
Purification and quantification of cRNA 

(used Qiagen Rneasy Mini kit Cat# 74103) 



mix by vortexing 
mix by pipetting 



10 cRNA ■ 40 :1 

DEPC H 2 0 60 :1 

RLT buffer 350:1 
EtOH 250 :1 

Total 700 :1 

1 5 Waited 1 minute or more for the RNA to stick 
Centrifiiged at 2000 rpm for 5 minutes 

RPE buffer 500:1 
Centrifiiged at 10,000 rpm for 1 minute 
RPE buffer 500:1 
20 Centrifiiged at 10,000 rpm for 1 minute 

Centrifiiged at 10,000 rpm for 1 minute to dry the column 

DEPCH 2 0 30:1 
Waited for 1 minute, then elute cRNA from by centrifiigation, 10K 1 minute 
DEPCH2O 30:1 
25 Repeated previous step 

Determined concentration and dilute to 1 :g/:l concentration 
Fragmentation of cRNA 

cRNA(l:g/:l) 15:1 
5X Fragmentation Buffer* 6 :1 
30 DEPCH 2 0 ii 

30 :1 
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*5x Fragmentation Buffer 

lMTris(pH8.1) 4.0 ml 
MgOAc 0.64 g 

KOAC 0.98 g 

5 DEPC H 2 0 

Total 20 ml 

Filter Sterilize 
Array washed and stained in: 

Stringent Wash Buffer** 
1 0 Non-Stringent Wash Buffer*** 

SAPE Stain**** 
Antibody Stain***** 
Washed on fluidics station using the appropriate antibody amplification protocol 

15 **Stringent Buffer: 12X MES 83.3 ml, 5 M NaCl 5.2 ml, 10% Tween 1.0 ml, 

H 2 O910ml, 

Filter Sterilize 

***Non-Stringent Buffer: 20X SSPE 300 ml, 10% Tween 1.0 ml, H 2 0 698 ml, 
Filter Sterilize, Antifoam 1 .0. 
20 ****SAPE stain: 2X Stain Buffer 600 :1, BSA 48 :1, SAPE 12:1, H 2 0 540 :1. 

*****Antibody Stain: 2X Stain Buffer 300 :1, H 2 0 266.4 :1, BSA 24 :1, Goat IgG 6 
:l,BiotinylatedAb 3.6:1 

Example 3: Profiling of plant stress-rezulated zenes 

25 A GeneChip® Rice Genome Array (Affymetrix, Santa Clara, CA) was used to 

identify clusters of genes that were coordinately induced in response to various stress 
conditions. The GeneChip® Rice Genome Array contains probes synthesized in situ and is 
designed to measure temporal and spatial gene expression of approximately 18,000 genes 
which covers approximately 40-50% of the genome. . 

30 The Affymetrix GeneChip® array was used to define nucleotide sequences/ pathways 

affected by various abiotic stresses and to define which are uniquely regulated by one stress 
and those that respond to multiple stress, and to identify candidate nucleotide sequences for 
screening for insertional mutants. Of the approximately 18,000 nucleotide sequences 
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represented on the Affymetrix GeneChip® array, certain nucleotide sequences showed at least 
a 2-fold change in expression in at least one sample, relative to no-treatment controls. 

The following describes in more detail how the experiments were done. Transcriptional 
profiling was performed by hybridizing fluorescence labeled cRNA with the oligonucleotides 
5 probes on the chip, washing, and scanning. Each gene is represented on the chip by about 
sixteen oligonucleotides (25-mers). Expression level is related to fluorescence intensity. 
Starting material contained 1 to 10 jag total RNA; detection specificity was about 1 :10 6 ; 
approximately a 2-fold change was detectable, with less than 2% false positive; the dynamic 
range was approximately 500x. Nucleotide sequences having up to 70% to 80% identity could 
1 0 be discriminated using this system. 

3.1 Growth conditions 

Rice plants were grown for 6 weeks in convirons in plastic pots filled with sand. The 
conditions of the conviron are 12 h/12h light/dark, 25°C, -50% RH and light intensity at 300 
jiEi. The plants were fertilized three times per week with one-half-strength Hoagland 
1 5 Solution containing 25 nM KH 2 P0 4 . 

3. 2 Abiotic stress treatment 

Six weeks after placing the rice plants in convirons, stresses were applied as follows: 
-Control - no treatment; 

•Drought = 25% PolyEthyleneGlycol (PEG) 8000 (PEG is a more 
20 controllable method for creating a water-deficit, the osmotic pressure from PEG will 

mimic the water-deficit experienced during drought) 

Osmotic Stress = 260.0 mM Mannitol (equivalent osmolarity of a 150.0 mM 
NaCl solution) 

AfaC/=150.0mM 

25 Cold = 14°C (the temperature at which pollen mother cell development is 

affected) 

The abiotic stress treatments was applied at time 0 and then at the same time of day 
on subsequent days (ie. Time 0, 24, 48 and 72 hours). 

3.3 Tissue sampling 

30 After the onset of treatment, 3 time points were harvested, namely, 3hr, 27 hr and 75 

hr. Leaves and roots were harvested separately and the tissue flash-frozen in liquid nitrogen. 
These time points are set to be the exact same time of day at all 3 time points to eliminate the 
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effects of circadian rhythms in gene expression. RNA was purified, and the samples were 
analyzed using the GeneChip® Rice Genome Array (Affymetrix, Santa Clara, CA) following the 
manufacturer's protocol 



3.4, Data analysis 

Raw fluorescence values as generated by Affymetrix software were processed as 
follows: the values were brought into Microsoft Excel® and values of 25 or less were set to 25 
(an empirically determined baseline as disclosed in Zhu and Wang, Plant Physiol 124:1472- 
1476; 2000). The values from the stressed samples were then converted to fold change relative 
to control by dividing the values from the stressed samples by the values from the no-treatment 
control samples. Expression patterns that were altered at least 2-fold with respect to the control 
were selected. This method gave very robust results and resulted in a larger number of 
nucleotide sequences called as stress-regulated than previous methods had permitted. 

Based on the profiles obtained following hybridization of nucleic acid molecules 
obtained from plant cells exposed to various stress conditions to the probes in the microarray, 
clusters of nucleotide sequences that were altered at least two-fold in response to the stress 
conditions were identified. 

Example 4 Identification of Abiotic Stress Responsive Genes by Yeast Two Hybrid System 
20 An automated, high-throughput yeast two hybrid assay technology provided by Myriad 

Genetics Inc., (Salt Lake City, UT) was used to search for protein interactions with a bait protein 
known to be inducible by chilling in rice. 

Multiple prtein fragments that encode recognizable motifs, or domains, were constructed 
as baits from the ORF encoding the protein to be studied. A screening protocol, which uses 
25 Myriad's proprietary strains and vectors, was then used to search the individual baits against two 
activation domain libraries of greater than five million cDNA clones of assorted peptide motifs. 
The libraries were derived from RNA isolated from leaves, stems and roots of rice plants grown 
in normal conditions plus tissues form plants exposed to various stresses (input trait library) and 
from various seed stages, callus, and early and late panicle (output trait library). Both hybrid 
30 proteins were expressed in a yeast reporter strain where an interaction between the test proteins 
results in transcription of the reporter genes TRP1 and LEU2, allowing growth on selective 
medium lacking tryptophan and leucine. Positives obtained from these searches were isolated 
and their identity was determined by sequence analysis against proprietary and public nucleic 
acid and protein databases. 
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To further characterize the polynucleotides encoding interacting proteins, the sequences 
of the baits and preys were compared with the gene fragments represented on a proprietary 
GeneChip® Rice Genome Array (Affymetrix, Santa Clara, CA) and where a polynucleotide was 
identified on the chip, its expression was experimentally determined. Experiments included 
5 evaluating differential gene expression from various plant tissues comprising seed, root, leaf and 
stem, panicle, and pollen. 

Using these methods, an ORF (SEQ ID NO: 17505) was identified encoding a protein 
that interacted with the chilling inducible bait. This ORF is contained in a full-length cDNA 
(SEQ ID NO: 17506) which also includes 258 bp of 5' UTR and 423 bp of 3 1 UTR. 

10 

Example 5: Rice Ortholoss of Arabidopsis Abiotic stress Genes Identified bv Reverse 
Genetics 

Understanding the function of every gene is the major challenge in the age of 
completely sequenced eukaryotic genomes. Sequence homology can be helpful in 
1 5 identifying possible functions of many genes. However, reverse genetics, the process of 
identifying the function of a gene by obtaining and studying the phenotype of an individual 
containing a mutation in that gene, is another approach to identify the function of a gene. 

Reverse genetics in Arabidopsis has been aided by the establishment of large publicly 
available collections of insertion mutants (Krysan et al., Plant Cell 1 1 :2283-2290, 1999; 

20 Tisseretal., PlantCeH, 11:1841-1852, 1999; Speulman et d, PlantCell 11:1853-1866, 1999; 
Parinov et al., Plant Cell 1 1 :2263-2270, 1999; Parinov and Sundaresan, Biotechnology, 
1 1:157-161, 2000). Mutations in genes of interest are identified by screening the population 
by PCR amplification using primers derived from sequences near the insert border and the 
gene of interest to screen through large pools of individuals. Pools producing PCR products 

25 are confirmed by Southern hybridization and further deconvoluted into subpools until the 
individual is identified (Sussman et al., Plant Physiology . 124:1465-1467, 2000). 

Recently, some groups have begun the process of sequencing insertion site flanking 
regions from individual plants in large insertion mutant populations, in effect prescreening a 
subset of lines for genomic insertion sites (Parinov et al., Plant Cell 1 1:2263-2270, 1999; 

30 Tisser et al., Plant Cell. 1 1 :1841-1 852, 1999). The advantage to this approach is that the 

laborious and time r consuming process of PCR-based screening and deconvolution of pools is 
avoided. 

A large database of insertion site flanking sequences from approximately 1 00,000 T- 
DNA mutagenized Arabidopsis plants of the Columbia ecotype (GARLIC lines) is prepared. 
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T-DNA left border sequences from individual plants are amplified using a modified thermal 
asymmetric interlaced-polymerase chain reaction (TAIL-PCR) protocol (Liu et al., Plant J. , 
8:457-463, 1995). Left border TAIL-PCR products are sequenced and assembled into a 
database that associates sequence tags with each of the approximately 100,000 plants in the 
5 mutant collection. Screening the collection for insertions in genes of interest involves a 

simple gene name or sequence BLAST query of the insertion site flanking sequence database, 
and search results point to individual lines. Insertions are confirmed using PCR. 

Analysis of the GARLIC insert lines suggests that there are 76,856 insertions that 
localize to a subset of the genome representing coding regions and promoters of 22,880 

10 genes. Of these, 49,231 insertions lie in the promoters of over 18,572 genes, and an 
additional 27,625 insertions are located within the coding regions of 13,612 genes. 
Approximately 25,000 T-DNA left border mTAIL-PCR products (25% of the total 102,765) 
do not have significant matches to the subset of the genome representing promoters and 
coding regions, and are therefore presumed to lie in noncoding and/or repetitive regions of 

15 the genome. 

The Arabidopsis T-DNA GARLIC insertion collection is used to investigate the roles 
of certain genes in abiotic stress. Target genes are chosen using a variety of criteria, 
including public reports of mutant phenotypes, RNA profiling experiments, and sequence 
similarity to genes implicated in abiotic stress. Plant lines with insertions in genes of interest 
20 are then identified. Each T-DNA insertion line is represented by a seed lot collected from a 
plant that is hemizygous for a particular T-DNA insertion. Plants homozygous for insertions 
of interest are identified using a PCR assay. The seed produced by these plants is 
homozygous for the T-DNA insertion mutation of interest. 

Homozygous mutant plants are tested for altered stress response. The genes 
25 interrupted in these mutants contribute to the observed phenotype. The genes interrupted in 
these mutants interfere with the normal response of the plant to abiotic stresses. 

Rice orthologs of the Arabidopsis genes affecting the plants response to an abiotic 
stress are identified by similarity searching of a rice database using the Double-AJFme Smith- 
Waterman algorithm (BLASP with e values better than ~ 10 ). 

30 

Example 6 : Cloning and Sequencing of Nucleic Acid Molecules from Rice 

6,1 Genomic DNA: Plant genomic DNA samples are isolated from a collection of 
tissues. Individual tissues are collected from a minimum of five plants and pooled. DNA can 
be isolated according to one of the three procedures, e.g., standard procedures described by 
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Ausubel et al. (1995), a quick leaf prep described by Klimyuk et al. ( Plant J., 3:493-494, 
1993), or using FTA paper (Life Technologies. Rockville, MD). 

For the latter procedure, a piece of plant tissue such as, for example, leaf tissue is 
excised from the plant, placed on top of the FTA paper and covered with a small piece of 
5 parafilm that serves as a barrier material to prevent contamination of the crushing device. In 
order to drive the sap and cells from the plant tissue into the FTA paper matrix for effective 
cell lysis and nucleic acid entrapment, a crushing device is used to mash the tissue into the 
FTA paper. The FTA paper is air dried for an hour. For analysis of DNA, the samples can 
be archived on the paper until analysis. Two mm punches are removed from the specimen 

1 0 area on the FTA paper using a 2 mm Harris Micro Punch™ and placed into PCR tubes. Two 
hundred (200) microliters of FTA purification reagent is added to the tube containing the 
punch and vortexed at low speed for 2 seconds. The tube is then incubated at room 
temperature for 5 minutes. The solution is removed with a pipette so as to repeat the wash 
one more time. Two hundred (200) microliters of TE (10 mM Tris, 0.1 mM EDTA, pH 8.0) 

1 5 is added and the wash is repeated two more times. The PCR mix is added directly to the 
punch for subsequent PCR reactions. 

6.2 Cloning of Candidate cDNA: A candidate cDNA is amplified from total RNA 
isolated from rice tissue after reverse transcription using primers designed against the 
computationally predicted cDNA. Primers designed based on the genomic sequence can be 

20 used to PCR amplify the full-length cDNA (start to stop codon) from first strand cDNA 
prepared from rice cultivar Nipponbare tissue. 

The Qiagen RNeasy kit (Qiagen, Hilden, Germany) is used for extraction of total RNA. 
The Superscript II kit (Invitrogen, Carlsbad, USA) is used for the reverse transcription 
reaction. PCR amplification of the candidate cDNA is carried out using the reverse primer 

25 sequence located at the translation start of the candidate gene in 5' - 3' direction. This is 
performed with high-fidelity Taq polymerase (Invitrogen, Carlsbad, USA). 

The PCR fragment is then cloned into pCR2.1-TOPO (Invitrogen) or the pGEM-T easy 
vector (Promega Corporation, Madison, Wis.) per the manufacturer's instructions, and 
several individual clones are subjected to sequencing analysis. 

30 6.3 DNA sequencing : DNA preps for 2-4 independent clones are miniprepped 

following the manufacturer's instructions (Qiagen). DNA is subjected to sequencing analysis 
using the BigDye™ Terminator Kit according to manufacturer's instructions (Applied 
Biosystems Inc., Foster City, CA) . Sequencing makes use of primers designed to both 
strands of the predicted gene of interest. DNA sequencing is performed using standard dye- 
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terminator sequencing procedures and automated sequencers (models 373 and 377; Applied 
Biosystems). All sequencing data are analyzed and assembled using the Phred/Phrap/Consed 
software package (University of Washington) to an error ratio equal to or less than 1CT 4 at the 
consensus sequence level. 
5 The consensus sequence from the sequencing analysis is then to be validated as being 

intact and the correct gene in several ways. The coding region is checked for being full 
length (predicted start and stop codons present) and uninterrupted (no internal stop codons). 
Alignment with the gene prediction and BLAST analysis is used to ascertain that this is in 
fact the right gene. 
10 The clones are sequenced to verify their correct amplification. 

Example 7 : Functional analysis in plants 

A plant complementation assay can be used for the functional characterization of the 
abiotic stress genes according to the invention. 

15 Rice and Arabidopsis putative orthologue pairs are identified using BLAST 

comparisons, TFASTXY comparisons, and Double- Affine Smith- Waterman similarity 
searches. Constructs containing a rice cDNA or genomic clone inserted between the promoter 
and terminator of the Arabidopsis orthologue are generated using overlap PCR (Horton et al., 
Gene, 77: 61-68, 1989) and GATEWAY cloning (Life Technologies Invitrogen. Carlsbad, 

20 CA). For ease of cloning, rice cDNA clones are preferred to rice genomic clones. A three 
stage PCR strategy is used to make these constructs. 

(1) In the first stage, primers are used to PCR amplify: (i) 2Kb upstream of the 
translation start site of the Arabidopsis orthologue, (ii) the coding region or cDNA of the rice 
orthologue, and (iii) the 500 bp immediately downstream of the Arabidopsis orthogue's 

25 translation stop site. Primers are designed to incorporate onto their 5* ends at least 16 bases 
of the 3* end of the adjacent fragment, except in the case of the most distal primers which 
flank the gene construct (the forward primer of the promoter and the reverse primer of the 
terminator). The forward primer of the promoters contains on their 5' ends partial AttBl 
sites, and the reverse primer of the terminators contains on their 5' ends partial AttB2 sites, 

30 for Gateway cloning. 

(2) In the second stage., overlap PCR js used to join either the promoter and the 
coding region, or the coding region and the terminator. 

(3) In the third stage, either the promoter-coding region product can be joined to 
the terminator or the coding region-terminator product can be joined to the promoter, using 
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overlap PCR and amplification with ftilll Att site-containing primers, to link all three 
fragments, and put full Att sites at the construct termini. 

The fused three- fragment piece flanked by Gateway cloning sites are introduced into 
the LTI donor vector pDONR201 using the BP clonase reaction, for confirmation by 
5 sequencing. Confirmed sequenced constructs are introduced into a binary vector containing 
Gateway cloning sites, using the LR clonase reaction such as, for example, pAS200. 

The pAS200 vector was created by inserting the Gateway cloning cassette RfA into the 
Acc65I site ofpNOV3510. 

pNOV3510 was created by ligation of inverted pNOV21 14 VSI binary into pCTK7- 
10 PTX5'AtPPONOS. 

pNOV21 14 was created by insertion of virGN54D (Pazour et al., J. Bacteriol. 
174:4169-4174, 1992) from pAD1289 (Hansen et al., Proc. Natl. Acad. Sci. USA 91:7603- 
7607, 1994) into P HiNK085. 

pHiNK085 was created by deleting the 35S:PMI cassette and M13 ori in pVictorHiNK. 
1 5 pPVictorHiNK was created by modifying the T-DNA of p Victor (described in WO 

97/041 12) to delete Ml 3 derived sequences and to improve its cloning versatility by 
introducing the BIGLINK polylinker. 

The sequence of the pVictor HiNK vector is disclosed in SEQ ID NO: 5 of WO 
00/6837, which is incorporated herein by reference. The pVictorHiNK vector contains the 
20 following constituents that are of functional importance: 

- The origin of replication (ORI) functional in Agrobacterium is derived from the 
Pseudomonas aeruginosa plasmid pVSl (Itoh et al., Plasmid. 1 1 : 206-220 1984; Itoh 
and Haas, Gene, 36: 27-36, 1985). The pVSl ORI is only functional in Agrobacterium 
and can be mobilized by the helper plasmid pRK2013 from E.coli into A. tumefaciens 

IS by means of a triparental mating procedure (Ditta et al., Proc. Natl. Acad. Sci USA. 

77:7347-7351,1980). 

- The ColEl origin of replication functional in E. coli is derived from pUC19 
(Yannisch-Perron et al., Gene . 33:103-1 19, 1985). 

- The bacterial resistance to spectinomycin and streptomycin encoded by a 0.93 kb 

30 fragment from transposon Tn7 (Fling et al., Nucl. Acids Res.. 13:7095, 1985) functions 

as selectable marker for maintenance of the vector in E. coli and Agrobacterium. The 
gene is fused to the tac promoter for efficient bacterial expression (Amman et al., Gene. 
25:167-178, 1983). 

- The right and left T-DNA border fragments of 1.9 kb and 0.9 kb that comprise 
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the 24 bp border repeats, have been derived from the Ti-plasmid of the nopaline type 
Agrobacterium tumefaciens strains pTiT37 (Yadav et al., Proc. Natl. Acad. Sci. USA.. 
79:6322-6326, 1982). 

The plasmid is introduced into Agrobacterium tumefaciens GV3101pMP90 by 
5 electroporation. The positive bacterial transfonnants are selected on LB medium 

containing 50 pg/jd kanamycin and 25 /xg//xl gentamycin. Plants are transformed by 
standard methodology (e.g., by dipping flowers into a solution containing the 
Agrobacterium) except that 0.02% Silwet -77 (Lehle Seeds, Round Rock, TX) is added 
to the bacterial suspension and the vacuum step omitted. Five hundred (500) mg of seeds 
1 0 are planted per 2 ft 2 flat of soil and plant transformants are selected by spraying with the 
herbicide formulated BASTA (2 ml of Finale, AgrEvo Environmental Health, Montvale, 
NJ, is added to 498 ml water) once every two days, for a week. 



Example 8: Vector construction for overexpression and zene "knockout " 
15 experiments 

8.1 Overexpression 

Vectors used for expression of full-length "abiotic stress candidate genes" of interest in 
plants (overexpression) are designed to overexpress the protein of interest and are of two 
general types, biolistic and binary, depending on the plant transformation method to be used. 

20 

For biolistic transformation (biolistic vectors), the requirements are as follows: 

1 . a backbone with a bacterial selectable marker (typically, an antibiotic resistance gene) 
and origin of replication functional in Escherichia coli (E. coli\ eg. ColEl), and 

2. a plant-specific portion consisting of: 

25 a. a gene expression cassette consisting of a promoter (eg. ZmUBIint MOD), the 

gene of interest (typically, a full-length cDNA) and a transcriptional 
terminator (eg. Agrobacterium tumefaciens nos terminator); 
b. a plant selectable marker cassette, consisting of a promoter (e.g. rice ActlD- 
BV MOD), selectable marker gene (e.g. phosphomannose isomerase, PMI) 

30 and transcriptional terminator (e.g. CaMV terminator). 

Vectors designed for transformation by Agrobacterium tumefaciens (A. tumefaciens; binary 
vectors) consist of: 
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1 . a backbone with a bacterial selectable marker functional in both E. coli and A. 
tumefaciens (e.g. spectinomycin resistance mediated by the aadA gene) and two 
origins of replication, functional in each of aforementioned bacterial hosts, plus the A. 
tumefaciens virG gene; 

5 2. a plant-specific portion as described for biolistic vectors above, except in this instance 
this portion is flanked by A. tumefaciens right and left border sequences which 
mediate transfer of the DNA flanked by these two sequences to the plant. 

8.2 Knock out vectors 

Vectors designed for reducing or abolishing expression of a single gene or of a family 
10 or related genes (knockout vectors) are also of two general types corresponding to the 
methodology used to downregulate gene expression: antisense or double-stranded RNA 
interference (dsRNAi). 

(a) Anti-sense 

For antisense vectors, a full-length or partial gene fragment (typically, a portion of the 
1 5 cDNA) can be used in the same vectors described for full-length expression, as part of the 
gene expression cassette. For antisense-mediated down-regulation of gene expression, the 
coding region of the gene or gene fragment will be in the opposite orientation relative to the 
promoter; thus, mRNA will be made from the non-coding (antisense) strand in planta. 

(b) dsRNAi 

20 For dsRNAi vectors, a partial gene fragment (typically, 300 to 500 basepairs long) is 

used in the gene expression cassette, and is expressed in both the sense and antisense 
orientations, separated by a spacer region (typically, a plant intron, e.g. the OsSHl intron 1, 
or a selectable marker, e.g. conferring kanamycin resistance). Vectors of this type are 
designed to form a double-stranded mRNA stem, resulting from the basepairing of the two 

25 complementary gene fragments in planta, 

Biolistic or binary vectors designed for overexpression or knockout can vary in a 
number of different ways, including e.g. the selectable markers used in plant and bacteria, the 
transcriptional terminators used in the gene expression and plant selectable marker cassettes, 
30 and the methodologies used for cloning in gene or gene fragments of interest (typically, 
conventional restriction enzyme-mediated or Gateway™ recombinase-based cloning). An 
important variant is the nature of the gene expression cassette promoter driving expression of 
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the gene or gene fragment of interest in most tissues of the plants (constitutive, eg. ZmUBIint 
MOD), in specific plant tissues (eg. maize ADP-gpp for endosperm-specific expression), or 
in an inducible fashion (eg. GAL4bsBzl for estradiol-inducible expression in lines 
constitutively expressing the cognate transcriptional activator for this promoter). 

5 

Example 9: Insertion of an "abiotic stress candidate sene" into an expression 
vector 

A validated rice cDNA clone in pCR2.1-TOPO or the pGEM-T easy vector is 
subcloned using conventional restriction enzyme-based cloning into a vector, downstream of 

10 the maize ubiquitin promoter and intron, and upstream of the Agrobacterium tumefaciens nos 
3* end transcriptional terminator. The resultant gene expression cassette (promoter, "abiotic 
stress candidate gene" and terminator) is further subcloned, using conventional restriction 
enzyme-based cloning, into the pNOV21 17 binary vector (Negrotto et al., Plant Cell Reports 
19, 798-803, 2000; plasmid pNOV117 discosed in this article corresponds to pNOV21 17 

1 5 described herein), generating pNOVC AND. 

The pNOVCAND binary vector is designed for transformation and over-expression of 
the "abiotic stress candidate gene" in monocots. It consists of a binary backbone containing 
the sequences necessary for selection and growth in Escherichia coli DH-5a (Invitrogen) and 
Agrobacterium tumefaciens LBA4404 (pAL4404; pSBl), including the bacterial 

20 spectinomycin antibiotic resistance aadA gene from E. coli transposon Tn7, origins of 

replication for E. coli (ColEl) and A tumefaciens (VS1), and the A tumefaciens virG gene. 
In addition to the binary backbone, which is identical to that of pNOV21 14 described herein 
previously (see Example 7 above), pNOV21 17 contains the T-DNA portion flanked by the 
right and left border sequences, and including the Positech™ (Syngenta) plant selectable 

25 marker (WO 94/20627) and the "abiotic stress candidate gene" gene expression cassette. 
The Positech™ plant selectable marker confers resistance to mannose and in this instance 
consists of the maize ubiquitin promoter driving expression of the PMI (phosphomannose 
isomerase) gene, followed by the cauliflower mosaic virus transcriptional terminator. 
Plasmid pNOV2 1 1 7 is introduced into Agrobacterium tumefaciens LB A4404 

30 (pAL4404; pSBl) by electroporation. Plasmid pAL4404 is a disarmed helper plasmid (Ooms 
et al., Plasmid. 7: 15-29, 1982). Plasmid pSBl is a plasmid with a wide host range that 
contains a region of homology to pNOV21 17 and a 15.2 kb Kpnl fragment from the 
virulence region of pTiBo542 (Ishida et al., Nat. BiotechnoL 14:745-750, 1996). 
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Introduction of plasmid pNOV21 17 into Agrobacterium strain LBA4404 results in a co- 
integration of pNOV21 1 7 and pSBl . 

Alternatively, plasmid pCIB7613, which contains the hygromycin phosphotransferase 
(hpt) gene (Gritz and Davies, Gene, 25:179-188, 1983) as a selectable marker, may be 
5 employed for transformation. 

Plasmid pCIB7613 (see WO 98/06860, incorporated herein by reference) is selected for 
rice transformation. In pCIB7613, the transcription of the nucleic acid sequence coding 
hygromycin-phosphotransferase (HY G gene) is driven by the corn ubiquitin promoter 
(ZmUbi) and enhanced by corn ubiquitin intron 1. The 3'polyadenylation signal is provided 
10 by NOS 3 ' nontranslated region. 

Other useful plasmids include pNADII002 (GAL4-ER-VP16) which contains the yeast 
GAL4 DNA Binding domain (Keegan et al., Science, 231:699, 1986), the mammalian 
estrogen receptor ligand binding domain (Greene et al., Science , 231:1 150, 1986) and the 
transcriptional activation domain of the HSV VP16 protein (Triezenberg et al., Genes Dev., 
15 2:71 8-729, 1988). Both hpt and GAL4-ER-VP16 are constitutively expressed using the 
maize Ubiquitin promoter, and pSGCDLl (GAL4BS Bzl Luciferase), which carries the 
firefly luciferase reporter gene under control of a minimal maize Bronzel (Bzl) promoter 
with 10 upstream synthetic GAM binding sites. All constructs use termination signals from 
the nopaline synthase gene. 

20 

Example 10: Rice Transformation 

pNOVCAND is transformed into a rice cultivar (Kaybonnet) using Agrobacterium- 
mediated transformation, and mannose-resistant calli are selected and regenerated. 

Agrobacterium is grown on YPC solid plates for 2-3 days prior to experiment initiation. 
25 Agrobacterial colonies are suspended in liquid MS media to an OD of 0.2 at X600nm. 

Acetosyringone is added to the agrobacterial suspension to a concentration of 200|iM and 
agro is induced for 30min. 

Three-week-old calli which are induced from the scutellum of mature seeds in the N6 
medium (Chu et al., Sci. Sin.. 18:659-668, 1975) are incubated in the agrobacterium solution 
30 in a 100 x 25 petri plate for 30 minutes with occasional shaking. The solution is then 

.removed with a pipet and the callus transfered to a MS As medium which is overlayed with 
sterile filter paper. 

Co-Cultivation is continued for 2 days in the dark at 22°C. 
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Calli are then placed on MS-Timetin plates for 1 week. After that they are tranfered to 
PAA + mannose selection media for 3 weeks. 

Growing calli (putative events) are picked and transfered to PAA+ mannose media and 
cultivated for 2 weeks in light. 
5 Colonies are tranfered to MS20SorbKinTim regeneration media in plates for 2 weeks in 

light. Small plantlets are transferred to MS20SorbKinTim regeneration media in GA7 
containers. When they reach the lid, they are transfered to soil in the greenhouse. 

Expression of the "abiotic stress candidate gene" in transgenic To plants is analyzed. 
10 Additional rice cultivars, such as but not limited to, Nipponbare, Taipei 309 and Fuzisaka 2 
are also transformed and assayed for expression of the "abiotic stress candidate gene" 
product and enhanced protein expression. 

Example 11: Analysis of mutant and transgenic plant material 

15 11. J Testing Arabidopsis seedlings using agar plates 

Arabidopsis seedlings can be assayed for abiotic stress phenotypes by measuring root 
growth rate under control and experimental conditions (Wu et al., Plant Cell 8:617-627, 
1996). Four to five day-old axenic seedlings are produced by germinating surface-sterilized 
seeds on agar growth medium plates oriented vertically. The seedlings are transferred to agar 

20 growth medium containing inhibitory levels of NaCl or Polyethylene glycol or mannitol or 
kept on the original plate for temperature stress (chilling stress = 4°C, freezing stress < 0°C, 
heat stress > 37°C) and control seedlings are transferred to a new plate containing normal 
growth medium. Upon transfer (or exposure to the temperature extreme), the plates are 
rotated 180 degrees so that subsequent root growth occurs in the exact opposite direction to 

25 previous growth due to the agravitropic root response. Growth subsequent to exposure to the 
abiotic stress is measured as the growth that occurs after bending of the root. Sensitivity or 
tolerance to abiotic stress is expressed as percent of experimental growth versus control 
growth following transfer. 

30 11.2 Testing Arabidopsis or rice plants growing in soil 

Adult Arabidopsis plants can be assayed for abiotic stress phenotypes by exposing 
soil-grown plants to abiotic stresses such as NaCl-, osmotic-, drought-, or temperature-stress 
and scoring for survivability following exposure (Wu et al., Plant Cell 8:617-627, 1996; 
Kasuga et al., Nat. Biotech.^ 17:287-291, 1999). Three-week-old plants are exposed to 
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abiotic stress conditions by sub-irrigating with NaCl, mannitol or polyethylene glycol, or the 
pots are moved from normal growing temperature to <0°C or > 37°C, or water is withheld. 
Resistant and sensitive phenotypes can be distinguished within 2 to 5 days of treatment 
depending on the stress (Kasuga et al., Nat. Biotech. , 17:287-291, 1999). Similar methods 
5 have been applied to cereal plants such as rice (Babu et al., Crop Sci. , 39:150-158, 1999; 
Saijo et al., Plant J., 23:319-327, 2000). It is also possible to assay abiotic stress phenotypes 
using plants growing hydroponically (Moons et al., Plant Physiol, 107:177-186, 1995; 
Kawaski et al., Plant Cell. 13:889-905, 2001). Young plants are grown in liquid nutrient 
medium and the stress treatments are applied by mixing in the stressful compounds such as 
10 NaCl, mannitol or Polyethylene glycol and assessing growth visually. 

Example 12: Chromosomal Markers to Identify the Location of a Nucleic Acid Sequence 

The sequences of the present invention can also be used for SSR mapping. SSR 
mapping in rice has been described by Miyao et al. flDNA Res., 3:233, 1996) and Yang et al. 

15 (Mol. Gen. Genet., 245:187, 1994), and in maize by Ahn et al. (Mol Gen. Genet.. 241 :483, 
1993). SSR mapping can be achieved using various methods. In one instance, 
polymorphisms are identified when sequence specific probes flanking an SSR contained 
within a sequence are made and used in polymerase chain reaction (PCR) assays with 
template DNA from two or more individuals or, in plants, near isogenic lines. A change in 

20 the number of tandem repeats between the SSR-flanking sequence produces differently sized 
fragments (U.S. Patent No. 5,766,847). Alternatively, polymoiphisms can be identified by 
using the PCR fragment produced from the SSR-flanking sequence specific primer reaction 
as a probe against Southern blots representing different individuals (Refseth et al., 
Electrophoresis, 18:1519, 1997). Rice SSRs can be used to map a molecular marker closely 

25 linked to functional gene, as described by Akagi et al. (Genome 39:205, 1996). 

The sequences of the present invention can be used to identify and develop a variety 
of microsatellite markers, including the SSRs described above, as genetic markers for 
comparative analysis and mapping of genomes. 

Some of the polynucleotides disclosed herein contain at least 3 consecutive di-, tri- or 

30 tetranucleotide repeat units in their coding region that can potentially be developed into SSR 
markers. Trinucleotide motifs that can be commonly found in the coding regions of said 
polynucleotides and easily identified by screening the polynucleotides sequences for said 
motifs are, for example: CGG; GCC, CGC, GGC, etc. Once such a repeat unit has been 
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found, primers can be designed which are complementary to the region flanking the repeat 
unit and used in any of the methods described below. 

Sequences of the present invention can also be used in a variation of the SSR 
technique known as inter-SSR (ISSR), which uses microsatellite oligonucleotides as primers 
5 to amplify genomic segments different from the repeat region itself (Zietkiewicz et al., 

Genomics , 20:176, 1994). ISSR employs oligonucleotides based on a simple sequence repeat 
anchored or not at their 5'- or 3'-end by two to four arbitrarily chosen nucleotides, which 
triggers site-specific annealing and initiates PCR amplification of genomic segments which 
are flanked by inversely orientated and closely spaced repeat sequences. In one embodiment 

10 of the present invention, microsatellite markers, or substantially similar sequences, or allelic 
variants thereof, may be used to detect the appearance or disappearance of markers indicating 
genomic instability as described by Leroy et al. (Electron. J. BiotechnoL 3(2), at 
http://www.ejb.org (2000)), where alteration of a fingerprinting pattern indicated loss of a 
marker corresponding to a part of a gene involved in the regulation of cell proliferation. 

15 Microsatellite markers are useful for detecting genomic alterations such as the change 
observed by Leroy et al {Electron. JBiotechnol, 3(2), supra (2000)) which appeared to be 
the consequence of microsatellite instability at the primer binding site or modification of the 
region between the microsatellites, and illustrated somaclonal variation leading to genomic 
instability. Consequently, sequences of the present invention are useful for detecting 

20 genomic alterations involved in somaclonal variation, which is an important source of new 
phenotypes. 

In addition, because the genomes of closely related species are largely syntenic (that 
is, they display the same ordering of genes within the genome), these maps can be used to 
isolate novel alleles from wild relatives of crop species by positional cloning strategies. This 
25 shared synteny is very powerful for using genetic maps from one species to map genes in 
another. For example, a gene mapped in rice provides information for the gene location in 
maize and wheat. 



Example 13: Quantitative Trait Linked Breeding 
30 Various types of maps can be used with the sequences of the invention to identify 

Quantitative Trait Loci (QTLs) for a variety of uses, including marker-assisted breeding. 
Many important crop traits are quantitative traits and result from the combined interactions of 
several genes. These genes reside at different loci in the genome, often on different 
chromosomes, and generally exhibit multiple alleles at each locus. Developing markers, 
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tools, and methods to identify and isolate the QTLs involved in a trait, enables marker- 
assisted breeding to enhance desirable traits or suppress undesirable traits. The sequences 
disclosed herein can be used as markers for QTLs to assist marker-assisted breeding. The 
sequences of the invention can be used to identify QTLs and isolate alleles as described by Li 
5 et al. in a study of QTLs involved in resistance to a pathogen of rice. (Li et al., Mol. Gen. 
Genet., 261:58, 1999). In addition to isolating QTL alleles in rice, other cereals, and other 
monocot and dicot crop species, the sequences of the invention can also be used to isolate 
alleles from the corresponding QTL(s) of wild relatives. Transgenic plants having various 
combinations of QTL alleles can then be created and the effects of the combinations 
10 measured. Once an ideal allele combination has been identified, crop improvement can be 
accomplished either through biotechnological means or by directed conventional breeding 
programs. (Flowers et al., J. Exp. Bot.. 51:99, 2000); Tanksley and McCouch, Science, 
277:1063, 1997). 

15 Example 14: Marker-Assisted Breeding 

Markers or genes associated with specific desirable or undesirable traits are known 
and used in marker assisted breeding programs. It is particularly beneficial to be able to 
screen large numbers of markers and large numbers of candidate parental plants or progeny 
plants. The methods of the invention allow high volume, multiplex screening for numerous 

20 markers from numerous individuals simultaneously. 

Markers or genes associated with specific desirable or undesirable traits are known 
and used in marker assisted breeding programs. It is particularly beneficial to be able to 
screen large numbers of markers and large numbers of candidate parental plants or progeny 
plants. The methods of the invention allow high volume, multiplex screening for numerous 

25 markers from numerous individuals simultaneously. 

A multiplex assay is designed providing SSRs specific to each of the markers of 
interest. The SSRs are linked to different classes of beads. All of the relevant markers may 
be expressed genes, so RNA or cDNA techniques are appropriate. RNA is extracted from 
root tissue of 1000 different individual plants and hybridized in parallel reactions with the 

30 different classes of beads. Each class of beads is analyzed for each sample using a 
microfluidics analyzer. For the classes of beads corresponding to qualitative traits, 
qualitative measures of presence or absence of the target gene are recorded. For the classes 
of beads corresponding to quantitative traits, quantitative measures of gene activity are 
recorded. Individuals showing activity of all of the qualitative genes and highest expression 
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levels of the quantitative traits are selected for further breeding steps. In procedures wherein 
no individuals have desirable results for all the measured genes, individuals having the most 
desirable, and fewest undesirable, results are selected for further breeding steps. In either 
case, progeny are screened to further select for homozygotes with high quantitative levels of 
5 expression of the quantitative traits. 

Example 15: Method of modifying the zene frequency 

The invention further provides a method of modifying the frequency of a gene in a 
plant population, including the steps of: identifying an SSR within a coding region of a gene; 

10 screening a plurality of plants using the SSR as a marker to determine the presence or 

absence of the gene in an individual plant; selecting at least one individual plant for breeding 
based on the presence or absence of the gene; and breeding at least one plant thus selected to 
produce a population of plants having a modified frequency of the gene. The identification 
of the SSR within the coding region of a gene can be accomplished based on sequence 

1 5 similarity between the nucleic acid molecules of the invention and the region within the gene 
of interest flanking the SSR. 

The results disclosed herein demonstrate that several polynucleotides, some of which 
were known to function as transcription factors, enzymes, and structural proteins, also are 
involved in the response of a plant cell to stress. The identification of stress-regulated genes as 
20 disclosed herein provides a means to identify stress-regulated regulatory elements present in rice 
nucleotide sequences, including consensus regulatoiy elements. Furthermore, the identification 
of the rice stress-regulated genes provides a means to identify the corresponding homologs and 
orthologs in other plants, including commercially valuable food crops such as wheat, maize, soy, 
and barley, and ornamental plants. . 



25 
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Table 1 

A = ORFs from drought tolerance QTL in rice chromosome 3 
B = Cold induced' genes from the rice genomic region 

syntenic with maize cold tolerance QTL 
C = Genes in the drought QTL on chromos 8 showing >=2x 

change during drought stress 
D = Rice proteins that have a universal stress protein domain 
E = ORFs from rice genomic region that is syntenic to a maize cold 

tolerance QTL 

F = ORFs from drought tolerance QTL in rice chromosome 8 
G = Abiotic stress regulated Universal stress protein 

domain containing genes 
H = Stress-regulated rice genes that are homologs of arabidopsis 

stress regulated genes 
I = Genes from drought QTL on Chromos 3 showing >= 2x change up or down 

during drought stress 

SEQ IDABCDEFGHI 



3177 X 

3178 X 

3179 X ----- - 

3180 X ------ - 

3181 X ------ - 

3182 X ------ - 

3183 X ------ - 

3184 X ------ - 

3185 X ------ - 

3186 X ------ - 

4379 X 
1328 X 

3187 ----- X - - 
2554 X - - 

3188 ----- x - - 
3189 X 

3190 x 

5827 x 

3191 X 

3192 x - - 

3193 ----- x - - 

3194 ----- X - - 
3195 X 

4863 x 

3196 ----- X - - 
355 X ------ _ 

3197 ----- X - - 

3198 ----- X - - 
795 X 

1583 X 

71 X - - - 

3199 X - - - 

3200 X 
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3207 


- 


- 


- 


X 


- 


_ 


_ 


3208 


X 


- 


- 


_ 


- 


- 


_ 


3209 


X 


- 
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_ 


_ 


_ 


3210 


X 
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_ 
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3211 
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- 




- 
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3212 
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- 
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X 




- 
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3215 
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- 
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_ 
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3216 
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X 
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_ 
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6692 


- 






_ 
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3217 
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X 
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- 


- 
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X 
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3222 
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_ 
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3223 
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_ _ 
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X 
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_ 




7356 


- 


- 
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- 
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_ 
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_ 


3233 


- 




X 
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_ 




X 




_ 


3972 






_ 


_ 


X 




_ 


3973 


_ 


_ _ 


_ 


_ 


X 




_ 


3974 


X 


_ _ 




_ 




_ 


_ 


3975 


X 


_ _ 




_ 


_ 


_ 


_ 


3976 


X 


_ _ 


_ 


_ 






_ 


3977 


X 


_ _ 




_ 


_ 




_ 


3978 


X 


„ — 


_ 


_ 


• 




_ 


1829 


X 


_ 


_ 




_ 






3979 










X 




- 


3980 


_ 


_ _ 


_ 


_ 


X 




- 


3981 


_ 


_ 




X 






- 


3982 


- 


- 


- 


X 


- 




- 


3983 




_ 


- 


X 


- 




- 


3984 






_ 


X 








3985 


_ 


_ 


_ 


X 








3986 






_ 


_ 


X 






3987 


X 


• 


_ 




_ 






3988 


X 


_ _ 


m 


_ 


_ 


_ 




3989 


X 














3990 


X 














3991 


X 














3.992 


x 














3993 










"X 






3994 










X 






2089 


X 














3995 


X 














3996 


X 














3997 


X 
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3998 


X 


_ 


- 


- 


- 


- 


- 


3999 


X 


_ 


- 


- 


_ 


- 


_ 


2544 


_ 


- 


X 


_ 




- 


- 


4000 


X 


_ 


- 


- 


- 


- 




546 


X 


- 


_ 


- 


_ 


- 


_ 


4001 


X 


_ 


_ 


_ 


_ 


- 


_ 


4002 


X 


_ 


_ 






_ 


_ 


4003 


X 


_ 


m 


_ 






_ 


4004 


X 




_ 




_ 


_ 


_ 


4005 




_ 




_ 


X 




_ 


8137 


_ 


X 


_ 


_ 






_ 


4007 






_ 


_ 


X 




_ 


4008 


- 


_ 


_ 


X 




_ 




4009 


_ _ 


_ 






X 


_ 


_ 


4010 


_ _ 


_ 


_ 


_ 


X 




_ 


5683 


_ _ 


_ 


_ 






X 


_ 


4011 


_ _ 


_ 


_ 


_ 


X 


_ 


_ 


4012 


_ _ 


_ 




X 


_ 




_ 


4013 


_ 


_ 


- 


- 


X 




- 


4014 


_ 


- 


- 




X 




- 


4015 


_ 


- 


- 


_ 


X 




- 


4016 


_ 


_ 




- 


X 




_ 


4017 




_ 




_ 


X 




_ 


4018 








_ 


X 






4508 


_ 


_ 


_ 


_ 


_ 


X 


_ 


4019 


_ 






_ 


X 




_ 


856 


X 




_ 








- 


4020 


X 


_ 


_ 


_ 


_ 




_ 


4021 




_ 


_ 


_ 


X 






4022 




_ 




X 






_ 


4023 


_ _ 






X 


_ 


_ 




4024 


X 


_ 






m 


_ 


_ 


4025 


X 


_ 


_ 


_ 


_ 






4026 




_ 


_ 


X 


_ 


_ 


.. 


4027 




_ 




X 


_ 


_ 


_ 


4028 


_ _ 




_ 


X 




_ 


_ 


4029 


_ _ 


_ 


_ 


X 


_ 


_ 


_ 


4030 


_ _ 


_ 


_ 




X 




_ 


4031 


_ — 


_ 


_ 


_ 


X 


_ 


_ 


4032 


_ _ 




_ 


X 




_ 




4033 


_ _ 


_ 


_ 


X 




_ 


_ 


6968 


_ _ 


_ 


_ 




_ 


X 




86 


_ 


_ 


• 


_ 


X 


_ 


_ 


4034 


- 






_ 


X 




_ 


4035 


_ 


_ 


_ 


_ 


X 






2419 


- 


_ 


- 


X 


_ 


- 


_ 


4036 


- 


_ 




X 


- 


- 


_ 


4037 


_ 


_ 


- 


- 


X 




- 


4038 


_ 


- 


- 


- 


X 




_ 


961 


X 


- 




- 


_ 




- 


4039 


X 


- 






_ 




_ 


4040 


X 


_ 






_ 




_ 


4041 










X 






1617 






X 










4042 


X 














4043 


. X_ - 














1517 


X 














4044 


X 














4045 










X 






4046 


X 














4047 


X 














4048 


X 
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4049 


X 


_ 


_ 


_ 




_ 


1815 


X 


_ 


_ 


_ 






2008 


X 


_ 


- 






_ 


409 


X 


_ 


- - 


_ 




_ 


4050 


X 


_ 




_ 




_ 


4051 


_ 




_ 


x 


_ 


_ 


117 


X 


_ 


_ _ 




_ 


_ 


4052 


_ 


_ 


_ 


X 


_ 


_ 


224 


X 


_ _ 


« _ 








4053 


X 


_ _ 


_ _ 




_ 


_ 


4054 


X 


_ _ 


_ _ 


_ . 




_ 


4055 


X 


_ _ 


_ _ 


_ 




_ 


4056 




- 


— — 


X 




_ 


4057 




_ 


_ _ 


X 




_ 


4783 


_ 


_ 


_ _ 




X 


_ 


4058 


X 


_ 


_ „ 








7242 




_ 


_ _ 




X 




4059 


X 


- 


_ 


_ 




_ 


4060 


X 




_ 


_ 


_ 


_ 


406 


X 


_ _ 


_ _ 


_ 


_ 




6320 




_ 


_ 


_ 


X 


_ 


4062 


_ 




_ _ 


X 


_ 


_ 


8183 


_ 


X 


_ 




_ 




4063 


_ 




_ _ 


X 




_ 


4064 


_ 


_ 




X 


» 


_ 


4065 


_ 


_ 




X 


_ 


_ 


4066 


_ 


_ _ 


_ _ 


X 


_ 


_ 


4067 


- 


_ _ 


_ _ 


X 


_ 


_ 


4068 




_ M 


_ 


X 






4069 




_ _ 


_ _ 


X 


_ 


_ 


1028 


_ 




X 






_ 


4070 




_ _ 




X 






4071 


_ 


_ _ 


_ _ 


X 


_ — 


_ 


4072 


_ 


_ _ 


• — 


X 




_ 


4073 


_ 


_ 


_ 


X 






348 


_ 


_ _ 


X 








4074 


X 


_ _ 










4075 


X 


_ _ 


_ _ 


_ 






6142 




_ 






X 


_ 


4076 


_ 


_ 




X 




_ 


2911 




_ 


_ 


X 


• 


_ 


4077 


_ 


_ _ 


_ _ 


X 


_ 


_ 


4078 


X 


_ _ 


_ _ 




— 


_ 


4079 


X 


_ _ 




_ 


_ 


_ 


4080 


X 


_ _ 




_ 






4081 


X 


_ _ 


_ _ 


_ 






4082 


X 




_ _ 


_ 


_ 


_ 


2903 


X 


_ _ 


_ 






_ 


4083 


X 


_ _ 


_ _ 


- 


_ 




4084 


X 


_ _ 




_ 






4085 




_ _ 


_ 


X 






4086 


X 


_ _ 


m _ 






_ 


4087 


X 












4088 








X 






4089 


X 












4090 


x_ 












4091 






-~ ~x 








6270 










X 




5093 










X 




492 


X 












4092 


X 












3023 


X 
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4093 


X 


- 


_ 




178 


X 






- 


2701 


X 


- 


- 


- 


5799 


_ 




_ 


X 


4094 






X 




4095 






X 




4096 


_ _ 




X 




4097 


X 




_ 




4098 




X 


_ 




4099 


_ 




X 




4100 


_ 




X 




4101 


_ 


X 


_ 




4102 


_ 


X 


_ 




4103 


_ 


X 






4104 


_ 


X 


_ 




4105 


- - 


X 


_ 




4106 




X 


_ 




4107 


_ _ 


X 


_ 




4108 


_ _ 


X 


_ 




5804 


_ _ 




_ 


X 


4109 


_ _ 


- — 


X 




4110 


- 


- - 


X 


— 


4111 


X 


_ _ _ 




_ 


4112 


X 


_ _ 






4113 


X 


- - 


_ 


_ 


6635 




_ _ _ 


_ 


X 


4114 


_ _ 


X 


_ 


_ 


4115 


- - 


X 




_ 


4116 


_ _ 




X 


_ 


5548 


_ 


_ _ 




X 


4117 


X 


_ _ _ 


_ 




4118 


X 


_ _ _ 


_ 


_ 


4119 


X 


- _ 






4120 


X 




- 


_ 


4121 


X 


_ _ _ 






4122 


X 




_ 


_ 


4123 


X 








2308 


X 








4124 




X 






4125 




X 






4126 




X 






4127 




X 






4128 




X 






4129 




X 






4130 




X 






4131 




X 
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Table 2. Expressed sequence tags from banana, wheat and maize having homology to the 
abiotic stress responsive polynucleotides of the present invention. 



SeqID 


T» TV XT T\ XT A 

BANANA 


t.TT TT1 TV m 

WHEAT 


MAT 90 

MAIZE 


1 




10978 




2 


9424 


10548 


12773 


3 




10164 


12546 


4 




11828 


13040 


6 


9321 


11158 


12911 


7 


— 


10172 




8 


9704 


11791 


12284 


9 




11268 


12668 


10 


9455 


9956 


13298 


11 




9920 




13 




11383 


132 82 


14 




11224 


1 a a 
12268 


15 




11281 




16 


9511 


11430 


13049 


17 


9654 


10905 


12734 


18 


9779 


10199 


122 91 


19 




11020 


ill fli 

13 191 


O 1 

21 


9396 


10584 


12226 


22 




11885 




23 




11932 




1 A 

24 


9365 


10132 


13 043 


26 


9830 


11796 


12 992 


27 


9237 


11511 


12372 


28 




10674 




*i Q 

29 




noil 
11813 


13 301 


30 






1 a 1 a a 
12187 


31 




11583 


1 1 1 C A 

13157 


32 


9643 


11085 


12066 


33 




11605 


12696 


34 


9814 


11093 


12829 


35 


9684 


11774 


12345 


36 




9972 


12926 


37 


9566 


11383 


12927 


38 




10334 




39 


a a n c 
9436 


1 ft ■» 1 *7 

10137 


T A A 1 A 

12013 


41 




11552 


12894 


A 1 

42 


9770 


9938 


12698 


43 


9476 


10860 


12729 


45 






12514 


46 




10983 




47 


9599 


10373 


13032 


48 




10216 




49 


9303 


11394 




50 


9289 


9892 




51 


9475 


10811 


12378 


52 




11737 




53 


9342 


10276 


12421 


54 - 




-113.86 . 


122^3^ 


55 




11457 


12503 


56 


9313 


10417 


12338 


57 




11064 


12324 


58 






12472 


59 


9225 


10611 


12365 


60 


9391 


11870 


13366 
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61 




62 


9586 


63 


9726 


64 




65 


_ 


66 


_ 


67 


9799 


68 




69 




70 




71 




72 


9573 


73 




74 


9600 


75 




76 


9360 


77 


9687 


78 




79 


9720 


80 


9456 


81 




82 




83 


9235 


84 


9504 


85 




86 


9546 


88 


9250 


89 




90 


_ 


93 


9312 


94 


9402 


96 




98 


9841 


99 


9429 


100 


9507 


101 




102 


9390 


103 


9682 


104 




105 




106 


9851 


107 


9446 


108 




109 




110 




111 


9557 


112 


9456 


114 




115 


9498 


116 


9772 


117 


9227 


119 




120 




121 


9382 


122 


9421 


123 


9701 


124 


9589 


125 


9870 


126 




127 




128 


9698 


129 


9799 



124 

11695 12510 

11317 13336 

11595 12072 

11172 13368 

11857 12910 

10265 12447 

11140 13349 

10784 12372 

11146 12618 
10315 
10462 

11817 12950 

9890 13060 

11127 12702 
12033 

11324 12916 

11795 13222 
12861 

11255 12103 

10373 13032 

10530 12290 

10026 13279 

10431 12832 
10917 

11840 12385 

11314 12697 

11675 13371 

11049 12557 

10193 13225 

11339 12858 

11707 13297 
11506 

11859 12364 

11190 12186 
9900 

10415 12308 
11852 

11047 12151 

11522 12998 
10649 

10789 12347 

10955 12129 
10319 

9946 12614 

9883 13065 
10563 

11149 12663 
9992 

10943 12329 

11378 

11411 

10235 

10172 

10202 13285 

10262 12183 

11671 12997 

11490 12069* 

10942 13257 

10265 12447 

11046 12056 

11832 12527 

11356 12660 
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125 



131 


9688 


11501 


13250 


132 




11145 


12663 


133 


m 


10197 


12170 


134 


_ 


11634 


12573 


135 


9465 




12155 


136 




11803 




137 




11161 




138 


9755 


11647 


12433 


139 




10322 




140 


9423 


10417 


12338 


141 




11657 


13137 


142 




11196 


12307 


143 




10021 




144 




11319 


13263 


145 


9630 


9886 


12302 


146 




11060 




147 




11161 




148 


9244 


10978 




149 


9555 


10368 


12781 


150 


9658 


11914 


12695 


151 




10253 


12826 


152 






12499 


153 


9756 


_ 


12744 


1S4 




10498 




155 


_ 


11844 


13019 


157 


_ 


11383 


11991 


158 


9626 


11508 


12834 


159 




9990 




160 


_ 


9997 


13071 


161 


_ 


10150 


12304 


162 




10697 




163 




10321 


13367 


165 




11097 


12094 


166 




11706 


13048 


167 




10157 


12981 


168 




10515 




169 




10628 


12573 


170 


9456 


11145 


12663 


171 




11016 


12355 


172 




9986 


13302 


173 




11544 


13083 


174 


_ 


10990 


12134 


175 


9742 


11076 


12914 


176 


9441 


11199 


12260 


177 




10232 


13090 


178 


9324 


10319 


12967 


179 




10040 




180 


9548 


11141 


12596 


181 


9799 


11356 


12660 


182 




10640 


12886 


183 


_ 


10868 




184 


9494 


10249 


13161 


186 




11776 


12372 


187 




10949 


12106 


188 


9377 


10429 


12467 


189 


9477 


10527 


13110 


"190 — 






T2107 


192 




10980 




193 




10537 




194 




10091 


13210 


195 


9543 


10769 


12440 


196 






12056 
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126 



197 


- 


11667 


13171 


198 


- 


- 


12170 


199 


- 


10173 


13265 


201 


9405 


11964 


12023 


202 


9806 


10744 


12302 


204 


_ 


10465 


_ 


206 




11728 


_ 


208 




11525 


12076 


209 


9489 


11145 


12663 


210 


9362 


10343 


12440 


211 




10458 


_ 


212 


_ 


11172 


_ 


213 


9260 


11692 


12575 


214 




11778 


12361 


215 


_ 


11136 


12766 


216 


- 


10682 


- 


217 


- 


11374 


13347 


218 


- 


10634 


_ 


220 


- 


10207 


- 


221 


- 


10640 


13118 


222 


9251 


_ 


_ 


223 


- 


10040 


- 


224 


9612 


11070 


13172 


226 


9365 


11738 


13237 


228 


9427 


10640 


12361 


229 


9430 


11663 


12945 


230 


_ 


10674 


12378 


231 


9615 


11590 


12867 


232 


- 


11941 


- 


233 


9365 


11739 


13297 


234 


9529 


_ 


_ 


235 


9720 


11633 


12103 


236 




10063 




237 


9736 


11128 


12030 


238 


_ 


11610 


12668 


239 


9388 


9919 


12269 


240 


9798 


11395 


13349 


241 


9434 


11724 


13050 


242 


- 


11226 


12738 


243 


- 


10640 


13118 


244 


- 


10916 


_ 


245 


- 


11779 


- 


246 


- 


9949 


- 


247 


- 


10532 


- 


248 


- 


10232 


13090 


249 


- 


9990 


12706 


250 


- 


11133 


13255 


253 


9528 


10227 


12750 


254 


9402 


10982 


12212 


255 


9870 


10950 


13088 


257 


9321 


11777 


12361 


258 


9446 


10759 


12380 


259 


9246 


11730 


13087 


260 




10728 




261 




10389 


12932 


262 


9448 


11634 


13220 


264 


" 960l" 


10800"* 


11996 


265 


9521 


11209 




266 




10983 




267 




11594 




268 


9292 


10417 


12337 


269 


9793 


10540 


13049 
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270 


- 


271 


9247 


272 


9384 


273 


- 


274 


9720 


275 


9371 


276 


- 


277 


_ 


278 


9704 


279 


9404 


280 


_ 


281 


- 


282 


- 


283 


_ 


284 


_ 


285 


- 


287 


9838 


289 


_ 


290 


_ 


291 


9751 


295 


_ 


296 


_ 


297 


9853 


298 




301 


9366 


304 




305 


9324 


306 


9308 


307 


9832 


308 


9812 


309 


9478 


310 


9589 


311 


_ 


312 


9692 


313 


_ 


314 




315 


9499 


316 


9880 


317 


9671 


318 


9532 


319 


_ 


320 


9446 


321 


- 


322 


9708 


324 


9416 


325 


- 


326 


9829 


327 


_ 


328 


9589 


329 


_ 


330 


- 


331 


_ 


332 


9666 


333 




334 




335 




336 




337 




338 




339 


9777 


340 


9313 


341 


9768 



127 



10023 


12778 


10297 


11973 


11733 


12942 


11215 


- 


11123 


12133 


10230 


_ 


11540 


12682 


11290 


13251 


11057 


12822 


11422 


13053 


10006 


_ 


11135 


12881 


10255 


13213 


_ 


13056 


9949 


12676 


10667 




11479 


_ 


10440 




_ 


13293 


11365 


12340 


10735 




11256 


13176 


10489 


12847 


11288 


12558 


11125 


12133 


10440 


_ 


11063 


13178 


11526 


13184 


11529 


12687 


10653 


12376 


11185 


12016 


11260 


_ 


10795 


12511 


10899 


12286 


11050 


12056 


11172 


13368 


10209 


12148 


10084 


11975 


11094 


12826 


11867 


12032 


10000 


13177 


10955 


12129 


9949 


_ 


11134 


12840 


10612 


12856 


10799 


_ 


10036 


12143 


11540 


12368 


11112 


12256 


11629 




11382 


12751 


11493 




11103 


12055 


10362* 




10696 




11040 


13092 


11819" 


119"88" 


11346 


13144 


11386 




11726 


13101 


10417 


12338 
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342 


9880 


343 


9869 


344 


- 


346 


- 


348 


- 


349 


- 


350 


- 


351 


9735 


352 


9880 


353 




355 


9810 


356 


9447 


357 




358 


_ 


359 


9392 


361 


9741 


362 


9436 


3 64 




365 


9262 


366 


_ 


367 


_ 


368 


9789 


369 




370 


9698 


371 


9331 


372 


9869 


374 




375 


9437 


376 


- 


377 


9880 


378 


9506 


379 


- 


380 


- 


381 


- 


382 


- 


383 


9269 


384 


9808 


385 


9412 


386 


9237 


387 


_ 


388 


9446 


389 


9305 


390 


_ 


391 


_ 


392 


9376 


394 


9302 


395 


- 


396 


- 


397 


9742 


400 


9635 


401 


9742 


402 


9833 


403 




404 


9678 


405 




406_ 




407 


9587" 


408 


9606 


409 




410 




412 


9851 


413 


9704 



128 

11733 12942 

10652 12393 
10005 
10209 

10332 12043 

10945 13051 
9925 

10119 13260 

10103 12942 

10640 12361 

11519 12654 

10993 12952 
10017 

12014 

11707 13043 

11077 13186 
10137 

10783 

9953 13129 
11850 

11058 12567 

10544 13234 
11386 

11832 12527 
11389 

11776 12393 
12868 

10146 

9926 12460 

11733 12942 

11632 12784 
10689 

11889 12533 
11666 

12123 

11846 

10214 12954 

11038 12147 

10594 13336 
10001 

10759 12380 

11397 12859 
9945 
10728 

11250 13318 

10777 12909 

10230 13213 
11488 

11072 12491 

11091 12713 

11078 13352 
11478 13167 
10850 

10062 12592 

10054 13090 

10946_ 12599 

10733 12573 

10388 13122 

10259 12811 

11595 12669 

10985 12183 

11057 12984 
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129 



414 


9365 


10143 


12458 


415 




10285 


13201 


417 


9575 


11234 


12402 


416 


_ 


10451 




419 


9256 


11199 


12260 


420 


9765 


11440 


12603 


421 




11214 




422 




10224 


12293 


424 


9553 


10458 




425 




10331 


12360 


426 




10199 


13118 


428 




11876 




429 




11642 




430 


_ 


10503 


13070 


431 




9948 




432 




11465 


12554 


433 


_ 


10040 




435 


9629 


10838 


12463 


436 






13086 


437 


9376 


11856 


12784 


438 


9324 


11205 


12967 


439 


9779 


11248 


13259 


440 


9782 


9974 


_ 


441 


9366 


11125 


12133 


442 


_ 


11576 


12792 


443 


9653 


11318 


12499 


444 


_ 


10329 


12866 


445 


_ 


11254 


12847 


446 


_ 


11811 


13051 


447 


9496 


11163 


13022 


448 


9518 




12583 


449 


9720 


11632 


12133 


450 




10422 




451 




9958 


12119 


452 


9401 


9950 


12022 


453 




10564 




454 


9446 


11202 


12428 


455 


9391 


10127 


12497 


456 




11593 


12012 


457 


9668 


10390 


12872 


459 


9380 


11797 


12041 


460 


9446 


11202 


12428 


461 




10690 


12351 


462 


_ 


10138 


13000 


463 


9589 


11490 




464 


9819 


11409 


12434 


465 


9871 


10942 


13187 


466 


_ 


11948 


12366 


467 


- 


9905 




468 


9682 


11044 


12044 


469 


- 


10028 




470 


- 


10453 


_ 


471 


9819 


11425 


12434 


472 


9401 


9913 


12424 


473 




10656 




474. 


9802 


11257 


12279 


475 


9363 


10743 


12324" 


476 






12285 


478 




11618 


12652 


480 




10745 




481 


9330 


10900 


12482 


482 






12008 
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130 



483 


_ 


11696 


13051 


484 


9564 


10500 


12583 


485 


9228 


10505 


12485 


486 


_ 


10685 




487 


9586 


11353 


12903 


488 


9355 


11092 


12656 


489 


9852 


10934 




490 


9248 


10913 




491 


9301 


11373 


12922 


493 




11665 




494 


9548 


11474 


12596 


495 




10764 


12324 


498 


9363 


10512 


12094 


500 




10804 




501 


_ 


11385 


_ 


502 


9684 


11765 


12318 


503 


9392 




12577 


504 




10222 




505 


9687 


11172 


12761 


506 




10184 




507 


9811 


10667 


13031 


508 


9739 


10654 


12889 


509 


_ 


11786 


_ 


511 


_ 


10286 


12084 


512 


9443 


11278 


12815 


513 


9720 


11523 


12316 


514 


_ 


10305 




515 


9450 


11304 


12429 


516 


_ 


10750 


13166 


517 


9244 


9998 


12140 


518 


9781 


11269 


12153 


519 




11650 


12681 


520 




11810 


13051 


523 


9704 


10079 


12530 


524 




10982 


13074 


525 




11145 


12912 


526 


— 


10165 


12077 


527 


_ 


10562 


12168 


529 


9705 


11104 


12341 


530 




11425 


13215 


531 


_ 


11556 


13255 


533 


9376 


11250 


12784 


534 


- 


9984 




535 


_ 


11024 


_ 


536 




11046 


12056 


537 


_ 


10937 


13219 


538 




10745 




539 


- 


11275 


13033 


540 


9436 


11910 


12013 


541 


9880 


10103 


12942 


542 


_ 


9915 


12419 


543 


_ 


9890 


13060 


544 


9540 


9947 




545 




11826 


12859 


546 




9976 


13155 


.547 


9308 


11461 




548 




11359 


12052 


549 


9731 


10923 




550 




10472 


12483 


551 




11835 


12705 


552 




9994 


12355 


553 


9313 


10417 


12338 
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556 




557 




558 


9719 


559 




560 


9537 


561 




562 




563 


_ 


564 


9880 


565 




566 


_ 


567 


9400 


569 


9722 


570 


9222 


571 




572 


9308 


573 




574 


_ 


575 




576 


9446 


577 




578 


_ 


579 


9285 


580 


9868 


581 


9851 


582 




583 


9639 


584 


9841 


585 


9426 


586 




587 


_ 


588 




590 


_ 


591 


9436 


592 


9423 


593 


9532 


594 




595 


_ 


596 


_ 


597 




598 


9267 


599 


9600 


600 




601 


_ 


603 


_ 


604 


9871 


605 


9277 


606 




607 


9870 


609 




612 


_ 


613 




614 


9819 


615 




616 


9520 


617 


9687 


61"9~ 


9368 


621 




622 




623 


9653 


624 




625 





131 

11464 
11160 

11902 12690 

11607 12400 

11514 12233 

11405 13001 
12378 

11277 12056 

11685 12837 

10526 12156 

11539 12449 

11298 13185 

10626 12374 

10693 12332 
12885 

9962 

10224 12293 

10098 

10672 

11201 12428 

10850 12388 

10023 12778 

11664 12273 

11137 12610 

11722 13031 

11763 12373 

10480 12209 

11088 12298 

11437 12275 

11374 13347 
11825 
9949 

11383 12886 

10589 12013 

11511 12372 

10752 12031 
11014 

11524 13230 

10032 

9977 

11450 13093 

11127 12702 

10983 12427 

10784 12372 

11045 12737 

10942 13257 

10403 12675 
10858 

10942 13257 

11077 13186 

10230 13189 

11061 12306 

10457 12434 
11155 

10635 12926 

11057 12984 
11716- " 13235 

11573 12486 
13158 

11318 13272 

10180 13363 
10983 
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132 



626 


_ 


11823 


12304 


627 


- 


10173 


12505 


628 


- 


11940 


12528 


629 


_ 


_ 


13164 


630 


9423 


11367 


12338 


631 


_ 


10245 


13100 


632 


9608 


11057 


12984 


633 




10983 


_ 


634 




10409 


13106 


635 


_ 




12789 


636 


9831 


11321 


13175 


638 




10853 


13277 


639 


9446 


11200 


12428 


640 




10576 


12568 


641 




11386 


12293 


642 


9339 


11708 


13081 


643 




11385 


12293 


644 






12886 


645 


_ 


10427 




647 


_ 


11628 


_ 


648 


9384 


9899 


12756 


649 




9898 


12773 


650 


9381 


10156 


12880 


651 


9261 


10401 


13160 


652 


9396 


10583 


12226 


653 


9656 


10385 


12879 


654 




10515 




655 


_ 




12249 


656 


9404 


10741 


13053 


657 


9481 


10073 


12854 


659 


9340 


11233 


12819 


661 




11001 




662 




11960 


13010 


663 


- 


10682 


13311 


664 


_ 


10166 


13128 


665 


9456 


11146 


12663 


668 


_ 


10729 


12817 


669 


9684 


10610 


12318 


670 


_ 


11851 


12830 


671 


_ 


10858 


12976 


673 


_ 


11154 


12118 


674 


9382 


11751 


_ 


676 


- 


11544 


12567 


677 


_ 


11940 


12528 


678 


_ 


10825 


12768 


680 


- 


10751 


12487 


681 


9477 


11560 


12161 


682 


9720 


11123 


12578 


683 


9523 


11206 


13081 


684 




10170 


12850 


685 


_ 


11053 


12661 


686 


_ 


11245 




687 


9831 


11803 




688 




10820 


13240 


689 


9358 


10151 


12775 


690 


- 9566 . 


9937 


12042 


691 




11481 




692 


9595 


11079 


12623 


693 


9878 


11216 


13078 


694 


9425 


10040 




695 


9681 


11579 


12718 


696 


9570 


11371 


12442 
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133 



697 


9642 


10477 


12294 


698 




9949 


_ 


700 


- 


10851 


13322 


701 


- 


11861 


13339 


702 


9784 


9906 


12350 


703 


9824 


11253 


12821 


704 


9404 


11596 


12960 


705 


9563 


10410 


13208 


706 




10620 


13092 


707 


9692 


11306 


13212 


708 




11380 


12419 


709 


9394 


11188 


12672 


710 




10407 




711 




9933 


12964 


712 


_ 


11741 


13252 


713 


_ 


10314 


12663 


714 


m 


11128 


13084 


715 


_ 


10964 




716 


9363 


11089 


12094 


717 




11162 




718 




10974 


12632 


719 


9574 


10622 


13069 


720 


9527 


11487 


12595 


721 


9475 


11609 




722 


9408 


10534 


12586 


723 


_ 


9936 


12109 


724 


9263 


11511 


12372 


725 


_ 


11523 


13263 


726 


_ 


11499 


13246 


727 


_ 


10801 


12868 


728 


_ 


10107 


13034 


729 


_ 


10009 


_ 


730 




10313 


_ 


731 


_ 


10858 




732 


_ 


10640 


12363 


733 




11956 


12574 


734 


_ 




13044 


735 


_ 


10678 


12751 


736 


_ 


9949 




737 


_ 


11746 


12604 


739 


9828 


11096 


12199 


740 




10765 


12380 


741 


9820 


10440 




742 


9821 


11090 


12026 


743 


_ 


10682 




745 


_ 


10739 




746 


9692 


10390 


12425 


747 




10078 




749 


9363 


11526 


_ 


750 


- 


10561 


_ 


751 


_ 


10139 


12094 


752 


9575 


11445 


12044 


753 




10369 




754 


9800 


10624 


12326 


755 


9779 


11249 


12166 


75.6 




10405 


13240 


757 


9778 


11893 


12040 


758 


9502 


10751 


12775 


759 


9518 


10599 


13289 


760 


9243 


11371 


12442 


761 


9719 


11161 




763 




10423 
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764 


_ 


765 


_ 


766 


9855 


767 


9497 


768 




769 


_ 


770 


9685 


771 




772 


_ 


773 


9558 


774 




775 




776 


9461 


777 


9734 


778 




779 




781 




782 




783 


9880 


784 




785 


_ 


786 


_ 


787 




788 


9742 


789 




790 




792 


9474 


793 




795 


9783 


796 


9217 


797 




799 


_ 


800 


9799 


802 




803 


9528 


804 


9761 


805 




806 


9363 


807 


9535 


808 




809 


9621 


810 




811 


_ 


812 


_ 


813 




815 


_ 


816 


9291 


817 


9308 


818 


9522 


820 




821 


9384 


822 




823 


9319 


824 


9376 


825 




826 




827 


9768 


828 


9494 


829 


9365 


830 




831 




832 


9575 



134 

10223 
10486 

11460 12872 

11065 12261 

11697 12089 
12591 

11280 12444 
10204 

11820 13282 

11012 12951 
12707 

10061 13147 

10286 12084 

11018 12281 
10515 
10631 

10266 12380 

10034 13325 

10103 12942 

11608 12400 

10015 12328 
11390 

10543 12888 

11075 12914 

9935 12109 

10690 12183 

10273 13359 
10725 

11230 13255 

11860 12466 
12014 

9943 

11392 13303 

11902 12681 

11204 12261 

10907 12913 
11143 

11475 12819 

11472 12719 

10510 12771 

10218 13244 

11899 12265 
10116 

10706 12446 

11900 13138 
11067 12035 
10671 

10330 

11097 12094 

11922 12756 

11459 12665 

10569 12562 

11126 12578 

11173 12822 
9926 

11054 12536 

10143 13043 
10920 
11543 

10595 12044 
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833 




834 


9430 


835 


_ 


836 


9461 


837 




839 


_ 


840 




841 


9544 


842 


9687 


843 


9255 


844 


9313 


845 




846 




847 


9446 


84 9 


9663 


850 




851 


9707 


852 




853 


_ 


854 




855 


9290 


856 


9449 


857 


9848 


858 


9620 


859 




861 


9656 


862 




863 


_ 


864 


_ 


866 


- 


867 


9445 


868 




869 




870 




872 




873 




874 




875 


9880 


877 




878 




880 




882 


9693 


883 


9384 


884 




885 


_ 


886 


9669 


888 




869 


_ 


890 


_ 


892 


_ 


893 


_ 


894 




895 




896 


9534 


898 


9435 


899 


9728 


900 




901 




902 




903 


9585 


905 




906 





135 

12625 

10164 12130 

11276 13240 

11334 13046 

10770 

10745 

10539 

11191 13109 

11237 12822 
10008 

10699 11977 
9990 

10058 12387 

11202 12428 

10120 12297 

11445 12925 

9966 12648 
10801 
10558 

11220 12091 
10096 

10587 12674 
10356 

11630 12282 
12423 

10443 12375 
12078 

10432 12915 

10433 12857 
11888 12138 
11762 12966 
10962 12943 
11344 12053 
11271 13345 
11115 12547 
11352 12959 
11880 12727 
10103 12942 
11732 

10932 12396 

11538 12253 

10129 12462 

10699 12099 

10645 12617 
10083 
11567 

11863 12417 

10798 13305 

11871 12187 
9903 
11745 

10387 12627 

10232 

11198 

10691 12079 

10325 12711 
9896 

13085 

11559 12071 

11048 12615 

11312 12821 
12107 
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136 



907 - 11366 13133 

908 - 11210 

909 - 10928 12919 

910 - 10658 12963 

911 • - - 12381 

912 - 10085 12213 

913 9460 11645 

914 - - 13212 

915 - 11213 12217 

916 9314 

917 - 11326 13287 

918 - 10155 

921 - 10013 

922 - - 13228 

923 - 10617 12990 

924 - 10112 12211 

926 - 11750 

927 9466 11453 12723 

928 - 10966 12716 

929 - 11469 12748 

930 - 11898 

931 9411 10545 12871 

934 - 10082 12145 

935 - 9908 

936 - 10287 12541 

937 - 11294 12625 

939 - 11660 12667 

940 - 10885 13181 

941 - 10863 

942 9327 10618 12075 

943 9714 10044 

944 - 11358 12799 

945 - 11219 13274 

948 9455 11100 12176 

949 - 11891 12211 

950 - 11052 12615 

951 - 10010 

952 - - 12534 

953 - 11285 12413 

954 - 11682 

955 9384 11922 12756 

956 - - 13369 

957 - - 12700 

958 9774 11129 13311 
960 - 10155 

962 9858 11178 12392 

963 - - 12177 

964 - 10796 12870 

965 9337 

966 9455 11764 

967 - 11067 

968 9614 10508 13021 

970 - 10233 

971 - 11926 

972 - - 12506 

974 - 11534 

975 9623 10509 12295 _ 

976 9867 10340 12452 

977 - - 13216 

978 ♦ - 10191 

979 - 10763 13357 

980 9538 10999 11976 
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981 


- 


982 


9506 


983 


_ 


984 


_ 


985 


9562 


986 


9837 


987 




988 


9789 


989 




990 




991 


_ 


992 




993 




994 


9241 


995 




996 




997 


9306 


998 




999 


9874 


1000 




1001 


9774 


1002 


9571 


1003 




1004 


_ 


1005 




1008 


_ 


1009 


9438 


1010 


9774 


1011 


9818 


1012 




1013 


9596 


1014 




1015 


_ 


1016 


_ 


1017 


9712 


1018 




1019 




1021 


9694 


1022 


9317 


1023 




1024 




1025 


_ 


1026 




1027 


9469 


1028 




1030 




1031 


_ 


1032 




1033 


_ 


1034 


_ 


1035 


_ 


1036 


_ 


1037 




1038 


9656 


1039 




1040 




1041 




1042 




1043 




1044 




1045 




1046 





137 

9928 12646 
11126 13264 
12581 

10472 
10684 
11372 

11033 12581 
11283 12149 
10495 13317 
10250 12847 
10773 12162 
10130 13143 
11087 12344 
11068 12516 
12918 
12848 

10095 12993 
10506 

11410 12227 
11361 12620 
11129 13134 
11428 12102 
10775 12418 
11749 13009 
10848 

12047 

10419 13028 
11129 13134 
11436 13273 
10839 

9923 12445 
11330 13017 
11967 13360 
11636 

11761 12679 
12806 
12925 

10590 13042 
11965 . 12969 
11971 

11187 12920 
11513 

12354 
12678 
12031 

11244 13238 
12450 

11462 12606 
10281 

10012 12064 
11177 12917 
11541 
10490 

10258 12375 
12580 

10348 13173 
10661 

11418 12865 
11336 12572 
10547 

11295 13248 
11809 12770 
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138 



1047 


- 


11627 


13054 


1048 


9393 


11701 


- 


1050 


- 


11042 


12710 


1051 


- 


11274 


13320 


1052 


- 


11124 


13176 


1053 


- 


10450 


12641 


1054 


- 


10379 


12735 


1055 


- 


11801 


- 


1056 


- 


11414 


12280 


10S7 


- 


11264 


12195 


1058 


- 


10963 


- 


1059 


- 


10488 


_ 


1060 


- 


10056 


_ 


1061 


9411 


10546 


_ 


1062 


- 


9884 


13248 


1064 


9636 


11736 


13249 


1065 


_ 


10902 




1066 


- 


10865 


12634 


1067 


9827 


10575 


12823 


1068 


_ 


_ 


13067 


1069 


9776 


11212 


12740 


1070 


9594 


10446 


12112 


1071 


- 


10476 


12015 


1072 


- 


10746 


- 


1073 


- 


11443 


12408 


1074 


- 


10292 


- 


1075 


9769 


11705 


- 


1076. 


9748 


10860 


12745 


1077 


- 


11320 


12720 


1078 


- 


10192 


12216 


1080 


- 


11470 


- 


1081 


9774 


10792 


13311 


1083 


9503 


11935 


12096 


1084 


- 


11898 


- 


1085 


- 


10136 


- 


1086 


- 


10081 


12841 


1087 


- 


11929 


- 


1088 


9617 


10111 


12642 


1091 


- 


10198 


12522 


1092 


- 


11164 




1093 


- 


10196 


- 


1094 


- 


11807 


12092 


1095 


- 


11214 


12255 


1096 


- 


10716 




1097 


9850 


11145 


12609 


1098 


- 


10253 


12851 


1099 


9333 


10133 


12465 


1100 


9490 


- 
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Conclusion 

In light of the detailed description of the invention and the examples presented above, 
it can be appreciated that the several aspects of the invention are achieved. 

It is to be understood that the present invention has been described in detail by way of 
5 illustration and example in order to acquaint others skilled in the art with the invention, its 
principles, and its practical application. Particular formulations and processes of the present 
invention are not limited to the descriptions of the specific embodiments presented, but rather 
the descriptions and examples should be viewed in terms of the claims that follow and their 
equivalents. While some of the examples and descriptions above include some conclusions 
1 0 about the way the invention may function, the inventor does not intend to be bound by those 
conclusions and functions, but puts them forth only as possible explanations. 

It is to be further understood that the specific embodiments of the present invention as 
set forth are not intended as being exhaustive or limiting of the invention, and that many 
alternatives, modifications, and variations will be apparent to those of ordinary skill in the art 
15 in light of the foregoing examples and detailed description. Accordingly, this invention is 
intended to embrace all such alternatives, modifications, and variations that fall within the 
spirit and scope of the invention. 
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What is claimed is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide selected from the group 
consisting of: 

a) any one of the nucleotide sequences selected from the group consisting of SEQ ID 
NOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506; 

b) a functional portion of any of the sequences of a); 

c) a polynucleotide that is substantially similar to a sequence of a) or b); 

d) a sequence of at least 15 nucletides that hybridizes under stringent conditions to a 
polynucleotide of a), b) or c); 

e) the complement of any sequence of a), b), c) or d); 

f) the reverse complement of any sequence of a), b), c) or d); 

g) a polynucleotide encoding any polypeptide selected from the group consisting of SEQ 
ID NOs. 4132-8262, 8354-8444, and 8830-9214; and 

h) an allelic variant of any of the above. 

2. The nucleic acid molecule of claim 1 , wherein said polynucleotide is present in a drought 
tolerance QTL located on rice chromosome 3. 

3. The nucleic acid molecule of claim 1, wherein said polynucleotide is from a genomic region 
syntenic with a maize cold tolerance QTL. 

4. The nucleic acid molecule of claim 1, wherein said polynucleotide is present in a drought 
tolerance QTL on rice chromosome 8. 

5. The nucleic acid molecule of claim 1, wherein said polynucleotide encodes a protein 
containing a Universal Stress Protein A domain. 

6. The nucleic acid molecule of any of the preceding claims, wherein said polynucleotide is 
from a plant. 

7. The nucleic acid molecule of any of the preceding claims, wherein said polynucleotide is 
from a monocot or a dicot. 



8. The nucleic acid molecule of any of claims 1-6, wherein said polynucleotide is from a cereal. 
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9. The nucleic acid molecule of claim 8 wherein said cereal is selected from the group 
consisting of maize, rice, wheat, barley, oat, rye, millet, milo, triticale, orchardgrass, guinea 
grass, sorghum and turfgrass. 

10. The nucleic acid of any of the preceding claims, wherein expression of said nucleic acid 
is altered 2X up or down in response to an abiotic stress. 

1 1 . The nucleic acid of claim 10 wherein said abiotic stress is selected from the group 
consisting of cold stress, salt stress, osmotic stress or any combination thereof. 

12. A vector comprising a polynucleotide of any of the preceding claims. 

13. The vector of claim 12, wherein said vector is a cloning vector or an expression vector. 

14. An expression cassette comprising a polynucleotide of any of claims Ml. 

15. A host cell comprising a vector or expression cassette of any of claims 12-14. 

16. The host cell of claim 15, wherein said host cell is a prokaryotic cell or a eukaryotic cell. 

17. The host cell of claim 16, wherein said host cell is abacterial cell, an insect cell, a yeast cell, 
a plant cell or an animal cell. 

1 8. The host cell of claim 1 7, wherein said host cell is a plant cell. 

19. A plant comprising a vector or expression cassette of any one of claims 12-14. 

20. A plant comprising a host cell of any of claims 15-18. 

21. A seed from the plant of claim 20. 

22. A seed comprising a vector or expression cassette of any of claims 12-14. 
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23. A plant produced from the seed of claim 22. 
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24. The progeny of a plant of claim 1 9 or 20. 

25. A hybrid of a plant of claim 19 or 20. 

26. An isolated polypeptide comprising: 

a) any one of the amino acid sequences selected from the group consisting of SEQ ID 
NOs. 4132-8262, 8354-8444, and 8830-9214; 

b) a functional portion of a); 

c) an amino acid sequence substantially similar to any sequence of a) or b); 

d) an amino acid sequence of a)-c) wherein there has been at least one conservative 
amino acid substitution; 

e) an allelic variant of any of a)-d). 

27. The polypeptide of claim 26, wherein said polypeptide contains a Universal Stress Protein 
A domain. 

28. The polypeptide of any one of claims 26-27, wherein said polypeptide is fium a plant. 

29. The polypeptide of any one of claims 26-28, wherein said polypeptide is from a monocot or 
adicot 

30. The polypeptide of any one of claims 26-29, wherein said polypeptide is from a cereal. 

3 1 . The polypeptide of claim 30 wherein said cereal is selected from the group consisting of 
com, rice, wheat, barley, oat, rye, millet, milo, triticale, orchardgrass, guinea grass, sorghum 
and turfgrass. 

32. The polypeptide of any one of claims 26-31, wherein expression of said polypeptide is 
altered 2X up or down in response to an abiotic stress. 

33. The polypeptide of claim 32 wherein said abiotic stress is selected from the group 
consisting of cold stress, salt stress, osmotic stress or any combination thereof. 
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34. A method for determining whether a test plant has been exposed to at least one abiotic 
stress condition comprising, 

determining expression of a plurality of polynucleotides selected from the group 
consisting of SEQEDNOs. 1-4131, 8263-8353, 8445-8829 and 17505-17506 in said test plant 
5 to produce an expression profile, and 

comparing said expression profile of said test plant to an expression profile for the 
same plurality of polynucleotides in a reference plant exposed to at least one abiotic stress. 

35. The method of claim 34, wherein said expression profile is produced by 

isolating RNA from a plant; 

contacting said RNA with a plurality of probes which selectively hybridize to the 
expressed polynucleotides under stringent conditions; and 
5 and detecting hybridization of said RNA to said probes. 

36. The method of claim 35 wherein said RNA is mRNA 

37. The method of claim 35 or 36, wherein said RNA is converted to cDNA 

38. The method of any of claims 34-37, wherein the abiotic stress is selected from the group 
consisting of saline stress, cold stress, osmotic stress or any combination thereof. 

39. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 10 polynucleotides. 

40. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 25 polynucleotides. 

41 . The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 50 polynucleotides. 

42. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 100 polynucleotides. 
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43. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 250 polynucleotides. 

44. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 500 polynucleotides. 

45. The method of any of claims 34-38, wherein said plurality of polynucleotides comprises 
at least 750 polynucleotides. 

46. The method of any of claims 34-38, wherein said expression profile is accomplished 
using a microarray. 

47. The method of any of claims 34-46, wherein said plant is selected from the group 
consisting of maize, rice, wheat, barley, oat, rye, millet, orchardgrass, guinea grass, sorghum 
and turfgrass. 

48. A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct comprising at least one of the 
polynucleotides selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 
8445-8829, and 11374-17506. 

49. A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct comprising a polynucleotide of at least 

1 5 nucleotides in length that hybridizes under high stringency conditions to the complement 
of a sequence selected from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445- 
8829, and 17505-17506. 

50. The method of claim 49, wherein said polynucleotide is operably linked to a 
heterologous promoter. 

5 1 . A method for altering the tolerance of a plant to an abiotic stress comprising introducing 
into said plant a recombinant nucleic acid construct, wherein said construct encodes a 
molecule which alters expression of an abiotic stress regulated coding region selected from 
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the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506, or 
the complement thereof. 

52. The method of claim 51 wherein said molecule comprises an antisense RNA, double 
stranded RNAi sequence, a triplexing agent, and a molecule containing a dominant negative 
mutation. 

53. A plant produced by the method of any of claims 48-52. 

54. A seed from a plant of claim 53. 

55. A method for selecting an agent that alters abiotic stress regulated polynucleotide 
expression in a plant cell comprising: 

contacting at least one plant cell with a test agent; 

subjecting said plant cell to an abiotic stress before, during or after contacting said 
plant cell with said test agent; 

obtaining an expression profile of said plant cell wherein said expression profile 
comprises expression data for at least one abiotic stress regulated sequence selected from the 
group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506.; 

comparing the expression profile of said cell to the expression profile obtained from 
at least one plant cell not exposed to said agent, but exposed to said abiotic stress; 

selecting said agent if said agent alters expression of said abiotic stress regulated 
sequence. 

56. The method of claim 55, wherein said plant cell is a cereal plant cell. 

57 The method of any one of claims 55-56, wherein said alteration in expression is at least 
about two-fold (2X). 

58. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 10 sequences. 

59. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 100 sequences. 
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60. The method of any one of claims 55-57, wherein said at least one abiotic stress regulated 
sequence comprises at least 500 sequences. 

61 . A computer readable medium having stored thereon computer executable instructions for 
performing a method comprising, 

receiving data on expression in a test plant of at least one polynucleotide having at 
least 70% sequence identity to a nucleotide sequence selected from the group consisting of 
5 SEQIDNOs. 1-4131, 8263-8353, 8445-8829, and 17505-17506, or the complement thereof; 
and 

comparing expression data from said test plant to expressin data for the same at least 
one nucleic acid molecule in a plant which has been exposed to at least one abiotic stress. 

62. A computer-readable medium having stored thereon a data structure comprising, 
sequence data for at least one polynucleotide having at least 70% nucleic acid sequence 
identity to a polynucleotide selected from the group consisting of SEQ ID NOs. 1-4131, 
8263-8353, 8445-8829, and 17505-17506, or the complement thereof; and a module 

5 receiving the nucleic acid molecule sequence data which compares the nucleic acid molecule 
sequence data to at least one other nucleic acid sequence. 

63. An antibody to any of the polypeptides of claims 26-33. 

64. The antibody of claim 63, wherein said antibody is a polyclonal antibody or a 
monoclonal antibody. 

65. A method for identifying a homolog or ortholog to an abiotic stress responsive 
polynucleotide comprising: 

a) determining the nucleotide sequence of a plurality of isolated polynucleotides to 
create a set of nucleotide sequences; 
5 b) translating the nucleotide sequences in the set to derive one or more putative amino 

acid sequences, based on one or more of the possible reading frames of the nucleotide 
sequences and their complementary sequences; 
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c) selecting an amino acid sequence of an abiotic stress-responsive protein and 
comparing the amino acid sequence of the abiotic stress-responsive protein with at least one 

1 0 of the putative amino acid sequences; 

d) identifying putative amino acid sequences having homology with at least a region 
of the amino acid sequence of the abiotic stress-responsive protein; and 

e) correlating putative amino acid sequences having said homology to translated 
nucleotide sequences of b). 

15 

66. The method of claim 65, further comprising: 

f) assembling a plurality of the translated nucleotide sequences that encode the 
putative amino acid sequences based on regions of overlap comprising at least 10 base pairs 
of identical sequences between two translated nucleotide sequences to form one or more 

5 nucleotide contigs; 

g) translating the one or more nucleotide contigs into one or more amino acid contigs; 

h) comparing the one or more amino acid contigs with the amino aicd sequence of the 
abiotic stress responsive protein to determine homology between the amino acid contig and at 
least a region of the abiotic stress responsive protein; and 

10 i) identifying at least one homolog of at least a region of the abiotic stress responsive 

protein based on the homology determined in h). 

67. The method of claim 65 wherein the abiotic stress responsive polynucleotide is selected 
from the group consisting of SEQ ID NOs. 1-4131, 8263-8353, 8445-8829, and 17505- 

15 17506. 

68. A method for altering the expression of a polynucleotide in response to an abiotic stress 
comprising operably linking said polynucleotide to a regulatory region selected from the 
group consisting of SEQ ID NOs. 1 3374-1 7504. 
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